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ISOLATED HUMAN TRANSPORTER PROTEUS, NUCLEIC ACID MOLECULES 
ENCODING HUMAN TRANSPORTER PROTEINS, AND USES THEREOF 



FIELD OF THE INVENTION 

5 The present invention is in the field of transporter proteins that are related to the Na+- 

independent transporter subfamily, recombinant DNA molecules, and protein production. 
The present invention specifically provides novel peptides and proteins that effect ligand 
transport and nucleic acid molecules encoding such peptide and protein molecules, all of 
which are useful in the development of human therapeutics and diagnostic compositions and 
10 methods. 



BACKGROUND OF THE INVENTION 

Transporters 

Transporter proteins regulate many different functions of a cell, including cell 
15 proliferation, differentiation, and signaling processes, by regulating the flow of molecules 

such as ions and macromolecules, into and out of cells. Transporters are found in the plasma 
membranes of virtually every cell in eukaryotic organisms. Transporters mediate a variety of 
cellular functions including regulation of membrane potentials and absorption and secretion 
of molecules and ion across cell membranes. When present in intracellular membranes of 
20 the Golgi apparatus and endocytic vesicles, transporters, such as chloride channels, also 

regulate organelle pH. For a review, see Greger, R. (1988) Annu. Rev. Physiol. 50:1 1 1-122. 

Transporters are generally classified by structure and the type of mode of action. In 
addition, transporters are sometimes classified by the molecule type that is transported, for 
example, sugar transporters, chlorine channels, potassium channels, etc. There may be many 
25 classes of channels for transporting a single type of molecule (a detailed review of channel 
types can be found at Alexander, S.P.H. and J.A. Peters: Receptor and transporter 
nomenclature supplement. Trends Pharmacol. ScL, Elsevier, pp. 65-68 (1997). 

The following general classification scheme is known in the art and is followed in the 
present discoveries. 

30 Channel-type transporters. Transmembrane channel proteins of this class are 

ubiquitously found in the membranes of all types of organisms from bacteria to higher 
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eukaryotes. Transport systems of this type catalyze facilitated diffusion (by an energy- 
independent process) by passage through a transmembrane aqueous pore or channel without 
evidence for a carrier-mediated mechanism. These channel proteins usually consist largely of 
a-helical spanners, although b-strands may also be present and may even comprise the 
5 channel. However, outer membrane porin-type channel proteins are excluded from this class 
and are instead included in class 9. 

Carrier-type transporters. Transport systems are included in this class if they utilize a 
carrier-mediated process to catalyze uniport (a single species is transported by facilitated 
diffusion), antiport (two or more species are transported in opposite directions in a tightly 
10 coupled process, not coupled to a direct form of energy other than chemiosmotic energy) 
and/or symport (two or more species are transported together in the same direction in a 
tightly coupled process, not coupled to a direct form of energy other than chemiosmotic 
energy). 

Pyrophosphate bond hydrolysis-driven active transporters. Transport systems are 
15 included in this class if they hydrolyze pyrophosphate or the terminal pyrophosphate bond in 
ATP or another nucleoside triphosphate to drive the active uptake and/or extrusion of a 
solute or solutes. The transport protein may or may not be transiently phosphorylated, but the 
substrate is not phosphorylated. 

PEP-dependent, phosphoryl transfer-driven group translocators. Transport systems of 
20 the bacterial phosphoenolpyruvatersugar phosphotransferase system are included in this 

class. The product of the reaction, derived from extracellular sugar, is a cytoplasmic sugar- 
phosphate. 

Decarboxylation-driven active transporters. Transport systems that drive solute (e.g., 
ion) uptake or extrusion by decarboxylation of a cytoplasmic substrate are included in this 
25 class. 

Oxidoreduction-driven active transporters. Transport systems that drive transport of a 
solute (e.g., an ion) energized by the flow of electrons from a reduced substrate to an 
oxidized substrate are included in this class. 

Light-driven active transporters. Transport systems that utilize light energy to drive 
30 transport of a solute (e.g., an ion) are included in this class. 

Mechanically-driven active transporters. Transport systems are included in this class 
if they drive movement of a cell or organelle by allowing the flow of ions (or other solutes) 
through the membrane down their electrochemical gradients. 
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Outer-membrane porins (of b-structure). These proteins form transmembrane pores or 
channels that usually allow the energy independent passage of solutes across a membrane. 
The transmembrane portions of these proteins consist exclusively of b-strands that form a b- 
baireL These porin-type proteins are found in the outer membranes of Gram-negative 
5 bacteria, mitochondria and eukaryotic plastids. 

Methyltransferase-driven active transporters. A single characterized protein currently 
falls into this category, the Na+-transporting methyltetrahydromethanopterinxoenzyme M 
methyltransferase. 

Non-ribosome-synthesized channel-forming peptides or peptide-like molecules. 
10 These molecules, usually chains of L- and D-amino acids as well as other small molecular 
building blocks such as lactate, form oligomeric transmembrane ion channels. Voltage may 
induce channel formation by promoting assembly of the transmembrane channel. These 
peptides are often made by bacteria and fungi as agents of biological warfare. 

Non-Proteinaceous Transport Complexes. Ion conducting substances in biological 
15 membranes that do not consist of or are not derived from proteins or peptides fall into this 
category. 

Functionally characterized transporters for which sequence data are lacking. 
Transporters of particular physiological significance will be included in this category even 
though a family assignment cannot be made. 

20 Putative transporters in which no family member is an established transporter. 

Putative transport protein families are grouped under this number and will either be 
classified elsewhere when the transport function of a member becomes established, or will be 
eliminated from the TC classification system if the proposed transport function is disproven. 
These families include a member or members for which a transport function has been 

25 suggested, but evidence for such a function is not yet compelling. 

Auxiliary transport proteins. Proteins that in some way facilitate transport across one 
or more biological membranes but do not themselves participate directly in transport are 
included in this class. These proteins always function in conjunction with one or more 
transport proteins. They may provide a function connected with energy coupling to transport, 

30 play a structural role in complex formation or serve a regulatory function. 

Transporters of unknown classification. Transport protein families of unknown 
classification are grouped under this number and will be classified elsewhere when the 
transport process and energy coupling mechanism are characterized. These families include 
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at least one member for which a transport function has been established, but either the mode 
of transport or the energy coupling mechanism is not known. 

Ion channels 

5 An important type of transporter is the ion channel. Ion channels regulate many 

different cell proliferation, differentiation, and signaling processes by regulating the flow of 
ions into and out of cells. Ion channels are found in the plasma membranes of virtually every 
cell in eukaryotic organisms. Ion channels mediate a variety of cellular functions including 
regulation of membrane potentials and absorption and secretion of ion across epithelial 

10 membranes. When present in intracellular membranes of the Golgi apparatus and endocytic 
vesicles, ion channels, such as chloride channels, also regulate organelle pH. For a review, 
see Greger, R. (1988) Annu. Rev. Physiol. 50:1 1 1-122. 

Ion channels are generally classified by structure and the type of mode of action. For 
example, extracellular ligand gated channels (ELGs) are comprised of five polypeptide 

15 subunits, with each subunit having 4 membrane spanning domains, and are activated by the 
binding of an extracellular ligand to the channel. In addition, channels are sometimes 
classified by the ion type that is transported, for example, chlorine channels, potassium 
channels, etc. There may be many classes of channels for transporting a single type of ion (a 
detailed review of channel types can be found at Alexander, S.P.H. and J.A. Peters (1997). 

20 Receptor and ion channel nomenclature supplement. Trends Pharmacol. Sci., Elsevier, pp. 
65-68 and http://www-biology.ucsd.edu/-msaier/transport/toc.html. 

There are many types of ion channels based on structure. For example, many ion 
channels fall within one of the following groups: extracellular ligand-gated channels (ELG), 
intracellular ligand-gated channels (ILG), inward rectifying channels (INR), intercellular 

25 (gap junction) channels, and voltage gated channels (VIC). There are additionally 

recognized other channel families based on ion-type transported, cellular location and drug 
sensitivity. Detailed information on each of these, their activity, ligand type, ion type, 
disease association, drugability, and other information pertinent to the present invention, is 
well known in the art. 

30 Extracellular ligand-gated channels, ELGs, are generally comprised of five 

polypeptide subunits, Unwin, N. (1993), Cell 72: 31-41; Unwin, N. (1995), Nature 373: 37- 

43; Hucho, F., et al., (1996) J. Neurochem. 66: 1781-1792; Hucho, F., et al., (1996) Eur. J. 

Biochem. 239: 539-557; Alexander, S.P.H. and J.A. Peters (1997), Trends Pharmacol. Sci., 

4 
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Elsevier, pp. 4-6; 36-40; 42-44; and Xue, H. (1998) J. Mol. Evol. 47: 323-333. Each subunit 
has 4 membrane spanning regions: this serves as a means of identifying other members of 
the ELG family of proteins. ELG bind a ligand and in response modulate the flow of ions. 
Examples of ELG include most members of the neurotransmitter-receptor family of proteins, 
5 e.g., GABAI receptors. Other members of this family of ion channels include glycine 
receptors, ryandyne receptors, and ligand gated calcium channels. 

The Voltage-gated Ion Channel fVIO Superfamilv 

Proteins of the VIC family are ion-selective channel proteins found in a wide range of 
bacteria, archaea and eukaryotes Hille, B. (1992), Chapter 9: Structure of channel proteins; 

10 Chapter 20: Evolution and diversity. In: Ionic Channels of Excitable Membranes, 2nd Ed., 
Sinaur Assoc. Inc., Pubs., Sunderland, Massachusetts; Sigworth, FJ. (1993), Quart. Rev. 
Biophys. 27: 1-40; Salkoff, L. and T. Jegla (1995), Neuron 15: 489-492; Alexander, S.P.H. 
et al., (1997), Trends Pharmacol. Sci., Elsevier, pp. 76-84; Jan, L.Y. et al., (1997), Annu. 
Rev. Neurosci. 20: 91-123; Doyle, D.A, et al., (1998) Science 280: 69-77; Terlau, H. and W. 

15 Stuhmer (1998), Naturwissenschaften 85: 437-444. They are often homo- or 

heterooligomeric structures with several dissimilar subunits (e.g., al-a2-d-b Ca 2+ channels, 
abib 2 Na + channels or (a)<rb K + channels), but the channel and the primary receptor is 
usually associated with the a (or al) subunit. Functionally characterized members are 
specific for K + , Na + or Ca 2+ . The K + channels usually consist of homotetrameric structures 

20 with each a-subunit possessing six transmembrane spanners (TMSs). The al and a subunits 
of the Ca 2+ and Na + channels, respectively, are about four times as large and possess 4 units, 
each with 6 TMSs separated by a hydrophilic loop, for a total of 24 TMSs. These large 
channel proteins form heterotetra-unit structures equivalent to the homotetrameric structures 
of most K + channels. All four units of the Ca 2+ and Na + channels are homologous to the 

25 single unit in the homotetrameric K + channels. Ion flux via the eukaryotic channels is 
generally controlled by the transmembrane electrical potential (hence the designation, 
voltage-sensitive) although some are controlled by ligand or receptor binding. 

Several putative K + -selective channel proteins of the VIC family have been identified 
in prokaryotes. The structure of one of them, the KcsA K + channel of Streptomyces lividans, 

30 has been solved to 3.2 A resolution. The protein possesses four identical subunits, each with 
two transmembrane helices, arranged in the shape of an inverted teepee or cone. The cone 
cradles the "selectivity filter" P domain in its outer end. The narrow selectivity filter is only 
12 A long, whereas the remainder of the channel is wider and lined with hydrophobic 
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residues. A large water-filled cavity and helix dipoles stabilize K + in the pore. The selectivity 
filter has two bound K + ions about 7.5 A apart from each other. Ion conduction is proposed 
to result from a balance of electrostatic attractive and repulsive forces. 

In eukaryotes, each VIC family channel type has several subtypes based on 
5 pharmacological and electrophysiological data. Thus, there are five types of Ca 2+ channels 
(L, N, P, Q and T). There are at least ten types of K + channels, each responding in different 
ways to different stimuli: voltage-sensitive [Ka, Kv, Kvr, Kvs and Ksr], Ca 2+ -sensitive 
[BKca, Kca and SKc a ] and receptor-coupled [K M and KaoJ. There are at least six types of 
Na + channels (I, n, in, \xl 9 HI and PN3). Tetrameric channels from both prokaryotic and 

10 eukaryotic organisms are known in which each a-subunit possesses 2 TMSs rather than 6, 
and these two TMSs are homologous to TMSs 5 and 6 of the six TMS unit found in the 
voltage-sensitive channel proteins. KcsA of S. lividans is an example of such a 2 TMS 
channel protein. These channels may include the K^a (Na + -activated) and Kvoi (cell volume- 
sensitive) K + channels, as well as distantly related channels such as the Tokl K + channel of 

15 yeast, the TWIK-1 inward rectifier K + channel of the mouse and the TREK-1 K + channel of 
the mouse. Because of insufficient sequence similarity with proteins of the VIC family, 
inward rectifier K + IRK channels (ATP-regulated; G-protein-activated) which possess a P 
domain and two flanking TMSs are placed in a distinct family. However, substantial 
sequence similarity in the P region suggests that they are homologous. The b, g and d 

20 subunits of VIC family members, when present, frequently play regulatory roles in channel 
activation/deactivation. 

The Epithelial Na* Channel (ENaO Family 

The ENaC family consists of over twenty-four sequenced proteins (Canessa, CM., et 
al., (1994), Nature 367: 463-467, Le, T. and M.H. Saier, Jr. (1996), Mol. Membr. Biol. 13: 

25 149-157; Garty, H. and L.G. Palmer (1997), Physiol. Rev. 77: 359-396; Waldmann, R., et 
al., (1997), Nature 386: 173-177; Darboux, L, et al., (1998), J. Biol. Chem. 273: 9424-9429; 
Firsov, D., et al., (1998), EMBO J. 17: 344-352; Horisberger, J.-D. (1998). Curr. Opin. 
Struc. Biol. 10: 443-449). All are from animals with no recognizable homologues in other 
eukaryotes or bacteria. The vertebrate ENaC proteins from epithelial cells cluster tightly 

30 together on the phylogenetic tree: voltage-insensitive ENaC homologues are also found in 
the brain. Eleven sequenced C. elegans proteins, including the degenerins, are distantly 
related to the vertebrate proteins as well as to each other. At least some of these proteins 
form part of a mechano-transducing complex for touch sensitivity. The homologous Helix 
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aspersa (FMRF-amide)-activated Na + channel is the first peptide neurotransmitter-gated 
ionotropic receptor to be sequenced. 

Protein members of this family all exhibit the same apparent topology, each with N- 
and C-termini on the inside of the cell, two amphipathic transmembrane spanning segments, 
5 and a large extracellular loop. The extracellular domains contain numerous highly conserved 
cysteine residues. They are proposed to serve a receptor function. 

Mammalian ENaC is important for the maintenance of Na + balance and the 
regulation of blood pressure. Three homologous ENaC subunits, alpha, beta, and gamma, 
have been shown to assemble to form the highly Na + -selective channel. The stoichiometry 
10 of the three subunits is alpha 2 ,betal, gammal in a heterotetrameric architecture. 

The Glutamate-gated Ion Channel (GIC) Family of Neurotransmitter Receptors 

Members of the GIC family are heteropentameric complexes in which each of the 5 
subunits is of 800-1000 amino acyl residues in length (Nakanishi, N., et al, (1990), Neuron 
5: 569-581; Unwin, N. (1993), Cell 72: 31-41; Alexander, S.P.H. and J.A. Peters (1997) 

15 Trends Pharmacol. Sci., Elsevier, pp. 36-40). These subunits may span the membrane three 
or five times as putative a-helices with the N-termini (the glutamate-binding domains) 
localized extracellularly and the C-termini localized cytoplasmically. They may be distantly 
related to the ligand-gated ion channels, and if so, they may possess substantial b-structure in 
their transmembrane regions. However, homology between these two families cannot be 

20 established on the basis of sequence comparisons alone. The subunits fall into six 
subfamilies: a, b, g, d, e and z. 

The GIC channels are divided into three types: (1) a-amino-3-hydroxy-5-methyl-4- 
isoxazole propionate (AMP A)-, (2) kainate- and (3) N-methyl-D-aspartate (NMDA)- 
selective glutamate receptors. Subunits of the AMPA and kainate classes exhibit 35-40% 

25 identity with each other while subunits of the NMDA receptors exhibit 22-24% identity with 
the former subunits. They possess large N-terminal, extracellular glutamate-binding domains 
that are homologous to the periplasmic glutamine and glutamate receptors of ABC-type 
uptake permeases of Gram-negative bacteria. All known members of the GIC family are 
from animals. The different channel (receptor) types exhibit distinct ion selectivities and 

30 conductance properties. The NMDA-selective large conductance channels are highly 

permeable to monovalent cations and Ca 2+ . The AMPA- and kainate-selective ion channels 
are permeable primarily to monovalent cations with only low permeability to Ca 2+ . 
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The Chloride Channel fCIO Family 

The C1C family is a large family consisting of dozens of sequenced proteins derived 
from Gram-negative and Gram-positive bacteria, cyanobacteria, archaea, yeast, plants and 
animals (Steinmeyer, K., et al., (1991), Nature 354: 301-304; Uchida, S., et al., (1993), J. 
5 Biol. Chem. 268: 3821-3824; Huang, M.-E., et al., (1994), J. MoL Biol. 242: 595-598; 
Kawasaki, M., et al, (1994), Neuron 12: 597-604; Fisher, W.E., et al., (1995), Genomics. 
29:598-606; and Foskett, J.K. (1998), Annu. Rev. Physiol. 60: 689-717). These proteins are 
essentially ubiquitous, although they are not encoded within genomes of Haemophilus 
influenzae, Mycoplasma genitalium, and Mycoplasma pneumoniae. Sequenced proteins vary 

10 in size from 395 amino acyl residues (MJannaschii) to 988 residues (man). Several 

organisms contain multiple C1C family paralogues. For example, Synechocystis has two 
paralogues, one of 451 residues in length and the other of 899 residues. Arabidopsis thaliana 
has at least four sequenced paralogues, (775-792 residues), humans also have at least five 
paralogues (820-988 residues), and C. elegans also has at least five (810-950 residues). 

1 5 There are nine known members in mammals, and mutations in three of the corresponding 
genes cause human diseases. E. coli 9 Methanococcus jannaschii and Saccharomyces 
cerevisiae only have one C1C family member each. With the exception of the larger 
Synechocystis paralogue, all bacterial proteins are small (395-492 residues) while all 
eukaryotic proteins are larger (687-988 residues). These proteins exhibit 10-12 putative 

20 transmembrane a-helical spanners (TMSs) and appear to be present in the membrane as 
homodimers. While one member of the family, Torpedo CIC-O, has been reported to have 
two channels, one per subunit, others are believed to have just one. 

All functionally characterized members of the C1C family transport chloride, some in 
a voltage-regulated process. These channels serve a variety of physiological functions (cell 

25 volume regulation; membrane potential stabilization; signal transduction; transepithelial 
transport, etc.). Different homologues in humans exhibit differing anion selectivities, i.e., 
C1C4 and C1C5 share a N0 3 " > CI' > Br > T conductance sequence, while C1C3 has an Y > 
CI" selectivity. The C1C4 and C1C5 channels and others exhibit outward rectifying currents 
with currents only at voltages more positive than +20mV. 

30 Animal Inward Rectifier K* Channel (IRK-O Family 

IRK channels possess the "minimal channel-forming structure" with only a P domain, 
characteristic of the channel proteins of the VIC family, and two flanking transmembrane 
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spanners (Shuck, M.E., et al., (1994), J. Biol. Chem. 269: 24261-24270; Ashen, M.D., et al., 
(1995), Am. J. Physiol. 268: H506-H511; Salkoff, L. and T. Jegla (1995), Neuron 15: 489- 
492; Aguilar-Bryan, L., et al., (1998), Physiol. Rev. 78: 227-245; Ruknudin, A., et al., 
(1998), J. Biol. Chem. 273: 14165-14171). They may exist in the membrane as homo- or 
5 heterooligomers. They have a greater tendency to let K + flow into the cell than out. Voltage- 
dependence may be regulated by external K + , by internal Mg 2 " 1- , by internal ATP and/or by 
G-proteins. The P domains of IRK channels exhibit limited sequence similarity to those of 
the VIC family, but this sequence similarity is insufficient to establish homology. Inward 
rectifiers play a role in setting cellular membrane potentials, and the closing of these 

1 0 channels upon depolarization permits the occurrence of long duration action potentials with a 
plateau phase. Inward rectifiers lack the intrinsic voltage sensing helices found in VIC family 
channels. In a few cases, those of Kir 1.1 a and Kir6.2, for example, direct interaction with a 
member of the ABC superfamily has been proposed to confer unique functional and 
regulatory properties to the heteromeric complex, including sensitivity to ATP. The SUR1 

15 sulfonylurea receptor (spQ09428) is the ABC protein that regulates the Kir6.2 channel in 
response to ATP, and CFTR may regulate Kir 1.1 a. Mutations in SUR1 are the cause of 
familial persistent hyperinsulinemic hypoglycemia in infancy (PHHI), an autosomal 
recessive disorder characterized by unregulated insulin secretion in the pancreas. 



functional neurotransmitter released by exocytosis from many types of neurons (North, R.A. 

(1996) , Curr. Opin. Cell Biol. 8: 474-483; Soto, F., M. Garcia-Guzman and W. Stiihmer 

(1997) , J. Membr. Biol. 160: 91-100). They have been placed into seven groups (P2Xi - 
P2X?) based on their pharmacological properties. These channels, which function at neuron- 

25 neuron and neuron-smooth muscle junctions, may play roles in the control of blood pressure 
and pain sensation. They may also function in lymphocyte and platelet physiology. They are 
found only in animals. 

The proteins of the ACC family are quite similar in sequence (>35% identity), but 
they possess 380-1000 amino acyl residues per subunit with variability in length localized 

30 primarily to the C-terminal domains. They possess two transmembrane spanners, one about 
30-50 residues from their N-termini, the other near residues 320-340. The extracellular 
receptor domains between these two spanners (of about 270 residues) are well conserved 
with numerous conserved glycyl and cysteyl residues. The hydrophilic C-termini vary in 



ATP-gated Cation Channel (ACC) Family 



20 



Members of the ACC family (also called P2X receptors) respond to ATP, a 
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length from 25 to 240 residues. They resemble the topologically similar epithelial Na + 
channel (ENaC) proteins in possessing (a) N- and C-termini localized intracellularly, (b) two 
putative transmembrane spanners, (c) a large extracellular loop domain, and (d) many 
conserved extracellular cysteyl residues. ACC family members are, however, not 
5 demonstrably homologous with them. ACC channels are probably hetero- or homomultimers 
and transport small monovalent cations (Me 4 ). Some also transport Ca 2+ ; a few also transport 
small metabolites. 

The Ryanodine-Inositol 1 A5-triphosphate Receptor Ca 2 * Channel (RIR-CaO Family 

Ryanodine (Ry)-sensitive and inositol 1,4,5 -triphosphate (IP3)-sensitive Ca 2+ -release 

10 channels function in the release of Ca 2+ from intracellular storage sites in animal cells and 
thereby regulate various Ca 2+ -dependent physiological processes (Hasan, G. et al., (1992) 
Development 116: 967-975; Michikawa, T., et al., (1994), J. Biol. Chem. 269: 9184-9189; 
Tunwell, R.E.A., (1996), Biochem. J. 318: 477-487; Lee, A.G. (1996) Biomembranes, Vol. 
6, Transmembrane Receptors and Channels (A.G. Lee, ed.), JAI Press, Denver, CO., pp 291- 

15 326; Mikoshiba, K., et al., (1996) J. Biochem. Biomem. 6: 273-289). Ry receptors occur 
primarily in muscle cell sarcoplasmic reticular (SR) membranes, and IP3 receptors occur 
primarily in brain cell endoplasmic reticular (ER) membranes where they effect release of 
Ca 2+ into the cytoplasm upon activation (opening) of the channel. 

The Ry receptors are activated as a result of the activity of dihydropyridine-sensitive 

20 Ca 2+ channels. The latter are members of the voltage-sensitive ion channel (VIC) family. 
Dihydropyridine-sensitive channels are present in the T-tubular systems of muscle tissues. 

Ry receptors are homotetrameric complexes with each subunit exhibiting a molecular 
size of over 500,000 daltons (about 5,000 amino acyl residues). They possess C-terminal 
domains with six putative transmembrane a -helical spanners (TMSs). Putative pore-forming 

25 sequences occur between the fifth and sixth TMSs as suggested for members of the VIC 
family. The large N-terminal hydrophilic domains and the small C-terminal hydrophilic 
domains are localized to the cytoplasm. Low resolution 3-dimensional structural data are 
available. Mammals possess at least three isoforms that probably arose by gene duplication 
and divergence before divergence of the mammalian species. Homologues are present in 

30 humans and Caenorabditis elegans. 

IP3 receptors resemble Ry receptors in many respects. (1) They are homotetrameric 

complexes with each subunit exhibiting a molecular size of over 300,000 daltons (about 

2,700 amino acyl residues). (2) They possess C-terminal channel domains that are 

10 
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homologous to those of the Ry receptors. (3) The channel domains possess six putative 
TMSs and a putative channel lining region between TMSs 5 and 6. (4) Both the large N- 
terminal domains and the smaller C-terminal tails face the cytoplasm. (5) They possess 
covalently linked carbohydrate on extracytoplasmic loops of the channel domains. (6) They 
5 have three currently recognized isoforms (types 1, 2, and 3) in mammals which are subject to 
differential regulation and have different tissue distributions. 

IP3 receptors possess three domains: N-terminal nVbinding domains, central 
coupling or regulatory domains and C-terminal channel domains. Channels are activated by 
IP3 binding, and like the Ry receptors, the activities of the IP3 receptor channels are regulated 
10 by phosphorylation of the regulatory domains, catalyzed by various protein kinases. They 
predominate in the endoplasmic reticular membranes of various cell types in the brain but 
have also been found in the plasma membranes of some nerve cells derived from a variety of 
tissues. 



15 spite of apparent structural similarities, do not show appreciable sequence similarity of the 
proteins of the VIC family. The Ry receptors and the IP3 receptors cluster separately on the 
RIR-CaC family tree. They both have homologues in Drosophila. Based on the phylogenetic 
tree for the family, the family probably evolved in the following sequence: (1) A gene 
duplication event occurred that gave rise to Ry and IP3 receptors in invertebrates. (2) 

20 Vertebrates evolved from invertebrates. (3) The three isoforms of each receptor arose as a 
result of two distinct gene duplication events. (4) These isoforms were transmitted to 
mammals before divergence of the mammalian species. 



25 intracellular membranes but not the plasma membranes of animal cells (Landry, D, et al., 
(1993), J. Biol. Chem. 268: 14948-14955; Valenzuela, Set al., (1997), J. Biol. Chem. 272: 
12575-12582; and Duncan, R.R., et al., (1997), J. Biol. Chem. 272: 23880-23886). 

They are found in human nuclear membranes, and the bovine protein targets to the 
microsomes, but not the plasma membrane, when expressed in Xenopus laevis oocytes. 

30 These proteins are thought to function in the regulation of the membrane potential and in 
transepithelial ion absorption and secretion in the kidney. They possess two putative 
transmembrane a-helical spanners (TMSs) with cytoplasmic N- and C-termini and a large 
luminal loop that may be glycosylated. The bovine protein is 437 amino acyl residues in 



The channel domains of the Ry and IP3 receptors comprise a coherent family that in 



The Organellar Chloride Channel (O-CIO Family 



Proteins of the O-CIC family are voltage-sensitive chloride channels found in 
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length and has the two putative TMSs at positions 223-239 and 367-385, The human nuclear 
protein is much smaller (241 residues). A G elegans homologue is 260 residues long. 

The present invention has a substantial similarity to rat small intestine Na+- 
independent transporter for aromatic amino acids that designated as TAT1 (T-type amino 
5 acid transporter 1). 

System T was originally characterized in human erythrocytes. It transports aromatic 
amino acids in a Na + -independent manner. Although it was once proposed that system T is a 
variant of system L which shows Na + -independent transport of neutral amino acids including 
aromatic amino acids, system T is distinct in that it accepts TV-methyl amino acids whereas 

10 system L does not. Therefore, it is reasonable to assume that transporters subserving system 
T would belong to a different family with distinct mechanisms of substrate recognition. 

The Na+-independent transporter is Na+-independent and low-affinity transport of 
aromatic amino acids such as tryptophan, tyrosine, and phenylalanine (Km values: 
approximately 5 mm), consistent with the properties of classical amino acid transport system 

15 T. TAT1 accepted some variations of aromatic side chains because it interacted with amino 
acid-related compounds such as 1-DOPA and 3-O-methyl-DOPA. TAT1 recognizes amino 
acid substrates as anions, because TAT1 accepted N-methyl- and N-acetyl-derivatives of 
aromatic amino acids but did not accept their methylesters. Consistent with this, TAT1 
exhibited sequence similarity (approximately 30% identity at the amino acid level) to 

20 H+/monocarboxylate transporters. Different from H+/monocarboxylate transporters, 
however, TAT1 was not coupled with the H+ transport but it mediates an electroneutral 
facilitated diffusion. In rat small intestine TAT1 immunoreactivity was detected in the 
basolateral membrane of the epithelial cells suggesting its role in the transepithelial transport 
of aromatic amino acids. For a further review of Na+-independent transporter, see Kim et al., 

25 J Biol Chem 2001 May 18;276(20): 1722 1-8. 

Transporter proteins, particularly members of the Na+-independent transporter 
subfamily, are a major target for drug action and development. Accordingly, it is valuable to 
the field of pharmaceutical development to identify and characterize previously unknown 
transport proteins. The present invention advances the state of the art by providing previously 

30 unidentified human transport proteins. 
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SUMMARY OF THE INVENTION 

The present invention is based in part on the identification of amino acid sequences 
of human transporter peptides and proteins that are related to the Na+-independent 
transporter subfamily, as well as allelic variants and other mammalian orthologs thereof. 
5 These unique peptide sequences, and nucleic acid sequences that encode these peptides, can 
be used as models for the development of human therapeutic targets, aid in the identification 
of therapeutic proteins, and serve as targets for the development of human therapeutic agents 
that modulate transporter activity in cells and tissues that express the transporter. 
Experimental data as provided in Figure 1 indicates expression in humans in the organs such 
10 as lung, brain and prostate etc, as well as in different tissues. 

DESCRIPTION OF THE FIGURE SHEETS 

FIGURE 1 provides the nucleotide sequence of a cDNA molecule or transcript 
sequence that encodes the transporter protein of the present invention. (SEQ ID NO: 1) In 
15 addition structure and functional information is provided, such as ATG start, stop and tissue 
distribution, where available, that allows one to readily determine specific uses of inventions 
based on this molecular sequence. Experimental data as provided in Figure 1 indicates 
expression in humans in the organs such as lung, brain and prostate etc, as well as in 
different tissues. 

20 FIGURE 2 provides the predicted amino acid sequence of the transporter of the 

present invention. (SEQ ID NO:2) In addition structure and functional information such as 
protein family, function, and modification sites is provided where available, allowing one to 
readily determine specific uses of inventions based on this molecular sequence. 

FIGURE 3 provides genomic sequences that span the gene encoding the transporter 

25 protein of the present invention. (SEQ ID NO: 3) In addition structure and functional 
information, such as intron/exon structure, promoter location, etc., is provided where 
available, allowing one to readily determine specific uses of inventions based on this 
molecular sequence. 94 SNPs, including 10 indels, have been identified in the gene 
encoding the transporter protein provided by the present invention and are given in Figure 3. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 

General Description 

The present invention is based on the sequencing of the human genome. During the 
sequencing and assembly of the human genome, analysis of the sequence information 
5 revealed previously unidentified fragments of the human genome that encode peptides that 
share structural and/or sequence homology to protein/peptide/domains identified and 
characterized within the art as being a transporter protein or part of a transporter protein and 
are related to the Na+-independent transporter subfamily. Utilizing these sequences, 
additional genomic sequences were assembled and transcript and/or cDNA sequences were 

10 isolated and characterized. Based on this analysis, the present invention provides amino acid 
sequences of human transporter peptides and proteins that are related to the Na+-independent 
transporter subfamily, nucleic acid sequences in the form of transcript sequences, cDNA 
sequences and/or genomic sequences that encode these transporter peptides and proteins, 
nucleic acid variation (allelic information), tissue distribution of expression, and information 

15 about the closest art known protein/peptide/domain that has structural or sequence homology 
to the transporter of the present invention. 

In addition to being previously unknown, the peptides that are provided in the present 
invention are selected based on their ability to be used for the development of commercially 
important products and services. Specifically, the present peptides are selected based on 

20 homology and/or structural relatedness to known transporter proteins of the Na+- 
independent transporter subfamily and the expression pattern observed. Experimental data 
as provided in Figure 1 indicates expression in humans in the organs such as lung, brain and 
prostate etc, as well as in different tissues.. The art has clearly established the commercial 
importance of members of this family of proteins and proteins that have expression patterns 

25 similar to that of the present gene. Some of the more specific features of the peptides of the 
present invention, and the uses thereof, are described herein, particularly in the Background 
of the Invention and in the annotation provided in the Figures, and/or are known within the 
art for each of the known Na+-independent transporter family or subfamily of transporter 
proteins. 

30 
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S pecific Embodiments 
Peptide Molecules 

The present invention provides nucleic acid sequences that encode protein molecules 
that have been identified as being members of the transporter family of proteins and are 
5 related to the Na+-independent transporter subfamily (protein sequences are provided in 
Figure 2, transcript/cDNA sequences are provided in Figures 1 and genomic sequences are 
provided in Figure 3). The peptide sequences provided in Figure 2, as well as the obvious 
variants described herein, particularly allelic variants as identified herein and using the 
information in Figure 3, will be referred herein as the transporter peptides of the present 
10 invention, transporter peptides, or peptides/proteins of the present invention. 

The present invention provides isolated peptide and protein molecules that consist of, 
consist essentially of, or comprising the amino acid sequences of the transporter peptides 
disclosed in the Figure 2, (encoded by the nucleic acid molecule shown in Figure 1, 
transcript/cDNA or Figure 3, genomic sequence), as well as all obvious variants of these 
15 peptides that are within the art to make and use. Some of these variants are described in 
detail below. 

As used herein, a peptide is said to be "isolated" or "purified" when it is substantially 
free of cellular material or free of chemical precursors or other chemicals. The peptides of the 
present invention can be purified to homogeneity or other degrees of purity. The level of 

20 purification will be based on the intended use. The critical feature is that the preparation allows 
for the desired function of the peptide, even if in the presence of considerable amounts of other 
components (the features of an isolated nucleic acid molecule is discussed below). 

In some uses, "substantially free of cellular material" includes preparations of the 
peptide having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), 

25 less than about 20% other proteins, less than about 10% other proteins, or less than about 5% 
other proteins. When the peptide is recombinantly produced, it can also be substantially free of 
culture medium, i.e., culture medium represents less than about 20% of the volume of the 
protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
30 preparations of the peptide in which it is separated from chemical precursors or other chemicals 
that are involved in its synthesis. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of the transporter peptide having 
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less than about 30% (by dry weight) chemical precursors or other chemicals, less than about 
20% chemical precursors or other chemicals, less than about 10% chemical precursors or other 
chemicals, or less than about 5% chemical precursors or other chemicals. 

The isolated transporter peptide can be purified from cells that naturally express it, 
5 purified from cells that have been altered to express it (recombinant), or synthesized using 
known protein synthesis methods. Experimental data as provided in Figure 1 indicates 
expression in humans in the organs such as lung, brain and prostate etc, as well as in different 
tissues. For example, a nucleic acid molecule encoding the transporter peptide is cloned into an 
expression vector, the expression vector introduced into a host cell and the protein expressed in 
10 the host cell. The protein can then be isolated from the cells by an appropriate purification 

scheme using standard protein purification techniques. Many of these techniques are described 
in detail below. 

Accordingly, the present invention provides proteins that consist of the amino acid 
sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 

1 5 transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO: 1) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). The amino acid sequence of such a protein is 
provided in Figure 2. A protein consists of an amino acid sequence when the amino acid 
sequence is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of the amino acid 

20 sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 

transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO: 1) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). A protein consists essentially of an amino acid 
sequence when such an amino acid sequence is present with only a few additional amino acid 
residues, for example from about 1 to about 100 or so additional residues, typically from 1 to 

25 about 20 additional residues in the final protein. 

The present invention further provides proteins that comprise the amino acid sequences 
provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the transcript/cDNA 
nucleic acid sequences shown in Figure 1 (SEQ ID NO: 1) and the genomic sequences provided 
in Figure 3 (SEQ ID NO:3). A protein comprises an amino acid sequence when the amino acid 

30 sequence is at least part of the final amino acid sequence of the protein. In such a fashion, the 
protein can be only the peptide or have additional amino acid molecules, such as amino acid 
residues (contiguous encoded sequence) that are naturally associated with it or heterologous 
amino acid residues/peptide sequences. Such a protein can have a few additional amino acid 
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residues or can comprise several hundred or more additional amino acids. The preferred classes 
of proteins that are comprised of the transporter peptides of the present invention are the 
naturally occurring mature proteins. A brief description of how various types of these proteins 
can be made/isolated is provided below. 
5 The transporter peptides of the present invention can be attached to heterologous 

sequences to form chimeric or fusion proteins. Such chimeric and fusion proteins comprise a 
transporter peptide operatively linked to a heterologous protein having an amino acid sequence 
not substantially homologous to the transporter peptide. "Operatively linked" indicates that the 
transporter peptide and the heterologous protein are fused in-frame. The heterologous protein 

1 0 can be fused to the N-terminus or C-terminus of the transporter peptide. 

In some uses, the fusion protein does not affect the activity of the transporter peptide per 
se. For example, the fusion protein can include, but is not limited to, enzymatic fusion proteins, 
for example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, MYC- 
tagged, Hi-tagged and Ig fusions. Such fusion proteins, particularly poly-His fusions, can 

1 5 facilitate the purification of recombinant transporter peptide. In certain host cells (e.g., 

mammalian host cells), expression and/or secretion of a protein can be increased by using a 
heterologous signal sequence. 

A chimeric or fusion protein can be produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the different protein sequences are ligated 

20 together in-frame in accordance with conventional techniques. In another embodiment, the 
fusion gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric gene 

25 sequence (see Ausubel et al, Current Protocols in Molecular Biology, 1992). Moreover, many 
expression vectors are commercially available that already encode a fusion moiety (e.g., a GST 
protein). A transporter peptide-encoding nucleic acid can be cloned into such an expression 
vector such that the fusion moiety is linked in-frame to the transporter peptide. 

As mentioned above, the present invention also provides and enables obvious variants 

30 of the amino acid sequence of the proteins of the present invention, such as naturally occurring 
mature forms of the peptide, allelic/sequence variants of the peptides, non-naturally occurring 
recombinantly derived variants of the peptides, and orthologs and paralogs of the peptides. 
Such variants can readily be generated using art-known techniques in the fields of recombinant 
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nucleic acid technology and protein biochemistry. It is understood, however, that variants 
exclude any amino acid sequences disclosed prior to the invention. 

Such variants can readily be identified/made using molecular techniques and the 
sequence information disclosed herein. Further, such variants can readily be distinguished 
5 from other peptides based on sequence and/or structural homology to the transporter peptides of 
the present invention. The degree of homology/identity present will be based primarily on 
whether the peptide is a functional variant or non-functional variant, the amount of divergence 
present in the paralog family and the evolutionary distance between the orthologs. 

To determine the percent identity of two amino acid sequences or two nucleic acid 

10 sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In a preferred embodiment, at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or 
more of a reference sequence is aligned for comparison purposes. The amino acid residues 

15 or nucleotides at corresponding amino acid positions or nucleotide positions are then 

compared. When a position in the first sequence is occupied by the same amino acid residue 
or nucleotide as the corresponding position in the second sequence, then the molecules are 
identical at that position (as used herein amino acid or nucleic acid "identity" is equivalent to 
amino acid or nucleic acid "homology")- The percent identity between the two sequences is 

20 a function of the number of identical positions shared by the sequences, taking into account 
the number of gaps, and the length of each gap, which need to be introduced for optimal 
alignment of the two sequences. 

The comparison of sequences and determination of percent identity and similarity 
between two sequences can be accomplished using a mathematical algorithm. 

25 {Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, 
H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von 
Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 

30 J., eds., M Stockton Press, New York, 1991). In a preferred embodiment, the percent identity 
between two amino acid sequences is determined using the Needleman and Wunsch (J. MoL 
Biol (48):444-453 (1970)) algorithm which has been incorporated into the GAP program in 
the GCG software package (available at http://www.gcg.com), using either a Blossom 62 
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matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length 
weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity 
between two nucleotide sequences is determined using the GAP program in the GCG 
software package (Devereux, J., et al, Nucleic Acids Res. 12(1)3%1 (1984)) (available at 
5 http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, 
or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity 
between two amino acid or nucleotide sequences is determined using the algorithm of E. 
Myers and W. Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated into the 
ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 

10 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used as 
a "query sequence" to perform a search against sequence databases to, for example, identify 
other family members or related sequences. Such searches can be performed using the 
NBLAST and XBLAST programs (version 2.0) of Altschul, et al (J. Mol Biol 215:403-10 

15 (1990)). BLAST nucleotide searches can be performed with the NBLAST program, score = 
100, wordlength = 12 to obtain nucleotide sequences homologous to the nucleic acid 
molecules of the invention. BLAST protein searches can be performed with the XBLAST 
program, score = 50, wordlength = 3 to obtain amino acid sequences homologous to the 
proteins of the invention. To obtain gapped alignments for comparison purposes, Gapped 

20 BLAST can be utilized as described in Altschul et al {Nucleic Acids Res. 25(17):3389-3402 
(1997)). When utilizing BLAST and gapped BLAST programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used. 

Full-length pre-processed forms, as well as mature processed forms, of proteins that 
comprise one of the peptides of the present invention can readily be identified as having 

25 complete sequence identity to one of the transporter peptides of the present invention as well as 
being encoded by the same genetic locus as the transporter peptide provided herein. The gene 
encoding the novel transporter protein of the present invention is located on a genome 
component that has been mapped to human chromosome 6 (as indicated in Figure 3), which is 
supported by multiple lines of evidence, such as STS and BAC map data. 

30 Allelic variants of a transporter peptide can readily be identified as being a human 

protein having a higji degree (significant) of sequence homology/identity to at least a portion of 
the transporter peptide as well as being encoded by the same genetic locus as the transporter 
peptide provided herein. Genetic locus can readily be determined based on the genomic 
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information provided in Figure 3, such as the genomic sequence mapped to the reference 
human. The gene encoding the novel transporter protein of the present invention is located on a 
genome component that has been mapped to human chromosome 6 (as indicated in Figure 3), 
which is supported by multiple lines of evidence, such as STS and BAC map data. As used 
5 herein, two proteins (or a region of the proteins) have significant homology when the amino 
acid sequences are typically at least about 70-80%, 80-90%, and more typically at least about 
90-95% or more homologous. A significantly homologous amino acid sequence, according 
to the present invention, will be encoded by a nucleic acid sequence that will hybridize to a 
transporter peptide encoding nucleic acid molecule under stringent conditions as more fully 

1 0 described below. 

Figure 3 provides information on SNPs that have been identified in a gene encoding 
the transporter protein of the present invention. 94 SNP variants were found, including 10 
indels (indicated by a "-") and ISNPs in exons. SNPs, identified at different nucleotide 
positions in introns and regions 5' and 3* of the ORF, may affect control/regulatory 

15 elements. 

Paralogs of a transporter peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the transporter peptide, as being 
encoded by a gene from humans, and as having similar activity or function. Two proteins will 
typically be considered paralogs when the amino acid sequences are typically at least about 

20 60% or greater, and more typically at least about 70% or greater homology through a given 
region or domain. Such paralogs will be encoded by a nucleic acid sequence that will 
hybridize to a transporter peptide encoding nucleic acid molecule under moderate to 
stringent conditions as more fully described below. 

Orthologs of a transporter peptide can readily be identified as having some degree of 

25 significant sequence homology/identity to at least a portion of the transporter peptide as well as 
being encoded by a gene from another organism. Preferred orthologs will be isolated from 
mammals, preferably primates, for the development of human therapeutic targets and agents. 
Such orthologs will be encoded by a nucleic acid sequence that will hybridize to a transporter 
peptide encoding nucleic acid molecule under moderate to stringent conditions, as more fully 

30 described below, depending on the degree of relatedness of the two organisms yielding the 
proteins. 

Non-naturally occurring variants of the transporter peptides of the present invention can 
readily be generated using recombinant techniques. Such variants include, but are not limited to 
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deletions, additions and substitutions in the amino acid sequence of the transporter peptide. For 
example, one class of substitutions are conserved amino acid substitution. Such substitutions 
are those that substitute a given amino acid in a transporter peptide by another amino acid of 
like characteristics. Typically seen as conservative substitutions are the replacements, one for 
5 another, among the aliphatic amino acids Ala, Val, Leu, and De; interchange of the hydroxyl 
residues Ser and Thr, exchange of the acidic residues Asp and Glu; substitution between the 
amide residues Asn and Gin; exchange of the basic residues Lys and Arg; and replacements 
among the aromatic residues Phe and Tyr. Guidance concerning which amino acid changes are 
likely to be phenotypically silent are found in Bowie et aL, Science 247:1306-1310 (1990). 

1 0 Variant transporter peptides can be fully functional or can lack function in one or more 

activities, e.g. ability to bind ligand, ability to transport ligand, ability to mediate signaling, etc. 
Fully functional variants typically contain only conservative variation or variation in non- 
critical residues or in non-critical regions. Figure 2 provides the result of protein analysis and 
can be used to identify critical domains/regions. Functional variants can also contain 

1 5 substitution of similar amino acids that result in no change or an insignificant change in 

function. Alternatively, such substitutions may positively or negatively affect function to some 
degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, 

20 inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in the 
art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al y 
Science 244:1081-1085 (1989)), particularly using the results provided in Figure 2. The latter 
procedure introduces single alanine mutations at every residue in the molecule. The resulting 

25 mutant molecules are then tested for biological activity such as transporter activity or in assays 
such as an in vitro proliferative activity. Sites that are critical for binding partner/substrate 
binding can also be determined by structural analysis such as crystallization, nuclear magnetic 
resonance or photoaffinity labeling (Smith et al 9 J. Mol Biol 224:899-904 (1992); de Vos et al 
Science 255:306-312 (1992)). 

30 The present invention further provides fragments of the transporter peptides, in addition 

to proteins and peptides that comprise and consist of such fragments, particularly those 
comprising the residues identified in Figure 2. The fragments to which the invention pertains, 

21 



WO 03/076644 ^^CT/US03/06667 



however, are not to be construed as encompassing fragments that may be disclosed publicly 
prior to the present invention. 

As used herein, a fragment comprises at least 8, 10, 12, 14, 16, or more contiguous 
amino acid residues from a transporter peptide. Such fragments can be chosen based on the 
5 ability to retain one or more of the biological activities of the transporter peptide or could be 
chosen for the ability to perform a function, e.g. bind a substrate or act as an immunogen. 
Particularly important fragments are biologically active fragments, peptides that are, for 
example, about 8 or more amino acids in length. Such fragments will typically comprise a 
domain or motif of the transporter peptide, e.g., active site, a transmembrane domain or a 

10 substrate-binding domain. Further, possible fragments include, but are not limited to, domain or 
motif containing fragments, soluble peptide fragments, and fragments containing immunogenic 
structures. Predicted domains and functional sites are readily identifiable by computer programs 
well known and readily available to those of skill in the art (e.g., PROSUE analysis). The 
results of one such analysis are provided in Figure 2. 

1 5 Polypeptides often contain amino acids other than the 20 amino acids commonly 

referred to as the 20 naturally occurring amino acids. Further, many amino acids, including the 
terminal amino acids, may be modified by natural processes, such as processing and other post- 
translational modifications, or by chemical modification techniques well known in the art. 
Common modifications that occur naturally in transporter peptides are described in basic texts, 

20 detailed monographs, and the research literature, and they are well known to those of skill in the 
art (some of these features are identified in Figure 2). 

Known modifications include, but are not limited to, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or 

25 lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, 

disulfide bond formation, demethylation, formation of covalent crosslinks, formation of cystine, 
formation of pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 

30 mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill in the ait and have been described in 
great detail in the scientific literature. Several particularly common modifications, 
glycosylation, lipid attachment, sulfation, gamma-caiboxylation of glutamic acid residues, 

22 



WO 03/076644 ^^CT/US03/06667 



hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as 
Proteins - Structure and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and 
Company, New York (1993). Many detailed reviews are available on this subject, such as by 
Wold, R, Posttranslational Covalent Modification of Proteins, B.C. Johnson, Ed., Academic 
5 Press, New York 1-12 (1983); Seifter et al (Meth. Enzymol 182: 626-646 (1990)) and Rattan et 
al {Ann. N.Y. Acad Sci. <5<53:48-62 (1992)). 

Accordingly, the transporter peptides of the present invention also encompass 
derivatives or analogs in which a substituted amino acid residue is not one encoded by the 
genetic code, in which a substituent group is included, in which the mature transporter peptide 
10 is fused with another compound, such as a compound to increase the half-life of the transporter 
peptide (for example, polyethylene glycol), or in which the additional amino acids are fused to 
the mature transporter peptide, such as a leader or secretory sequence or a sequence for 
purification of the mature transporter peptide or a pro-protein sequence. 



15 Protein/Peptide Uses 

The proteins of the present invention can be used in substantial and specific assays 
related to the functional information provided in the Figures; to raise antibodies or to elicit 
another immune response; as a reagent (including the labeled reagent) in assays designed to 
quantitatively determine levels of the protein (or its binding partner or ligand) in biological 

20 fluids; and as markers for tissues in which the corresponding protein is preferentially 
expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state). Where the protein binds or potentially binds to another 
protein or ligand (such as, for example, in a transporter-effector protein interaction or 
transporter-ligand interaction), the protein can be used to identify the binding partner/ligand 

25 so as to develop a system to identify inhibitors of the binding interaction. Any or all of these 
uses are capable of being developed into reagent grade or kit format for commercialization as 
commercial products. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include "Molecular Cloning: A Laboratory Manual", 

30 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", 
Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 
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The potential uses of the peptides of the present invention are based primarily on the 
source of the protein as well as the class/action of the protein. For example, transporters 
isolated from humans and their human/mammalian orthologs serve as targets for identifying 
agents for use in mammalian therapeutic applications, e.g. a human drug, particularly in 
5 modulating a biological or pathological response in a cell or tissue that expresses the 

transporter. Experimental data as provided in Figure 1 indicates that transporter proteins of 
the present invention are expressed in the human lung, brain, prostate, ovary, placenta, 
thymus, colon, and pancreas. Specifically, the protein also expressed in the tissues such as 
small cell carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, adenocarcinoma, 

10 fibrotheoma, pooled germ cell tumors and Islets of Langerhans. A large percentage of 

pharmaceutical agents are being developed that modulate the activity of transporter proteins, 
particularly members of the Na+-independent transporter subfamily (see Background of the 
Invention). The structural and functional information provided in the Background and 
Figures provide specific and substantial uses for the molecules of the present invention, 

1 5 particularly in combination with the expression information provided in Figure 1 . 

Experimental data as provided in Figure 1 indicates expression in humans in the organs such as 
lung, brain and prostate etc, as well as in different tissues. Such uses can readily be 
determined using the information provided herein, that known in the art and routine 
experimentation. 

20 The proteins of the present invention (including variants and fragments that may have 

been disclosed prior to the present invention) are useful for biological assays related to 
transporters that are related to members of the Na+-independent transporter subfamily. Such 
assays involve any of the known transporter functions or activities or properties useful for 
diagnosis and treatment of transporter-related conditions that are specific for the subfamily of 

25 transporters that the one of the present invention belongs to, particularly in cells and tissues that 
express the transporter. Experimental data as provided in Figure 1 indicates that transporter 
proteins of the present invention are expressed in the human lung, brain, prostate, ovary, 
placenta, thymus, colon, and pancreas. Specifically, the protein also expressed in the tissues 
such as small cell carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, 

30 adenocarcinoma, fibrotheoma, pooled germ cell tumors and Islets of Langerhans. The 

proteins of the present invention are also useful in drug screening assays, in cell-based or cell- 
free systems ((Hodgson, Bio/technology, 1992, Sept 10(9);973-80). Cell-based systems can be 
native, i.e., cells that normally express the transporter, as a biopsy or expanded in cell culture. 
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Experimental data as provided in Figure 1 indicates expression in humans in the organs £uch as 
lung, brain and prostate etc, as well as in different tissues. In an alternate embodiment, cell- 
based assays involve recombinant host cells expressing the transporter protein. 

The polypeptides can be used to identify compounds that modulate transporter activity 
5 of the protein in its natural state or an altered form that causes a specific disease or pathology 
associated with the transporter. Both the transporters of the present invention and appropriate 
variants and fragments can be used in high-throughput screens to assay candidate compounds 
for the ability to bind to the transporter. These compounds can be further screened against a 
functional transporter to determine the effect of the compound on the transporter activity. 

10 Further, these compounds can be tested in animal or invertebrate systems to determine 
activity/effectiveness. Compounds can be identified that activate (agonist) or inactivate 
(antagonist) the transporter to a desired degree. 

Further, the proteins of the present invention can be used to screen a compound for the 
ability to stimulate or inhibit interaction between the transporter protein and a molecule that 

15 normally interacts with the transporter protein, e.g. a substrate or a component of the signal 

pathway that the transporter protein normally interacts (for example, another transporter). Such 
assays typically include the steps of combining the transporter protein with a candidate 
compound under conditions that allow the transporter protein, or fragment, to interact with the 
target molecule, and to detect the formation of a complex between the protein and the target or 

20 to detect the biochemical consequence of the interaction with the transporter protein and the 

target, such as any of the associated effects of signal transduction such as changes in membrane 
potential, protein phosphorylation, cAMP turnover, and adenylate cyclase activation, etc. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, 
including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et 

25 a/., Nature 354:82-84 (1991); Houghten et al. 9 Nature 354:84-86 (1991)} and combinatorial 
chemistry-derived molecular libraries made of D- and/or L- configuration amino acids; 2) 
phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide 
libraries, see, e.g., Songyang et ai, Cell 72:767-778 (1993)); 3) antibodies (e.g., polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies as well as Fab, 

30 F(ab') 2 , Fab expression library fragments, and epitope-binding fragments of antibodies); and 4) 
small organic and inorganic molecules (e.g., molecules obtained from combinatorial and natural 
product libraries). 
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One candidate compound is a soluble fragment of the receptor that competes for ligand 
binding. Other candidate compounds include mutant transporters or appropriate fragments 
containing mutations that affect transporter function and thus compete for ligand. Accordingly, 
a fragment that competes for ligand, for example with a higher affinity, or a fragment that binds 
5 ligand but does not allow release, is encompassed by the invention. 

The invention further includes other end point assays to identify compounds that 
modulate (stimulate or inhibit) transporter activity. The assays typically involve an assay of 
events in the signal transduction pathway that indicate transporter activity. Thus, the transport 
of a ligand, change in cell membrane potential, activation of a protein, a change in the 

10 expression of genes that are up- or down-regulated in response to the transporter protein 
dependent signal cascade can be assayed. 

Any of the biological or biochemical functions mediated by the transporter can be used 
as an endpoint assay. These include all of the biochemical or biochemical/biological events 
described herein, in the references cited herein, incorporated by reference for these endpoint 

1 5 assay targets, and other functions known to those of ordinary skill in the art or that can be 
readily identified using the information provided in the Figures, particularly Figure 2. 
Specifically, a biological function of a cell or tissues that expresses the transporter can be 
assayed. Experimental data as provided in Figure 1 indicates that transporter proteins of the 
present invention are expressed in the human lung, brain, prostate, ovary, placenta, thymus, 

20 colon, and pancreas. Specifically, the protein also expressed in the tissues such as small cell 
carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, adenocarcinoma, 
fibrotheoma, pooled germ cell tumors and Islets of Langerhans. 

Binding and/or activating compounds can also be screened by using chimeric 
transporter proteins in which the amino terminal extracellular domain, or parts thereof, the 

25 entire transmembrane domain or subregions, such as any of the seven transmembrane segments 
or any of the intracellular or extracellular loops and the carboxy terminal intracellular domain, 
or parts thereof, can be replaced by heterologous domains or subregions. For example, a 
ligand-binding region can be used that interacts with a different ligand then that which is 
recognized by the native transporter. Accordingly, a different set of signal transduction 

30 components is available as an end-point assay for activation. This allows for assays to be 
performed in other than the specific host cell from which the transporter is derived. 

The proteins of the present invention are also useful in competition binding assays in 
methods designed to discover compounds that interact with the transporter (e.g. binding 
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partners and/or ligands). Thus, a compound is exposed to a transporter polypeptide under 
conditions that allow the compound to bind or to otherwise interact with the polypeptide. 
Soluble transporter polypeptide is also added to the mixture. If the test compound interacts with 
the soluble transporter polypeptide, it decreases the amount of complex formed or activity from 
5 the transporter target. This type of assay is particularly useful in cases in which compounds are 
sought that interact with specific regions of the transporter. Thus, the soluble polypeptide that 
competes with the target transporter region is designed to contain peptide sequences 
corresponding to the region of interest. 

To perform cell free drug screening assays, it is sometimes desirable to immobilize 

1 0 either the transporter protein, or fragment, or its target molecule to facilitate separation of 

complexes from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. 

Techniques for immobilizing proteins on matrices can be used in the drug screening 
assays. In one embodiment, a fusion protein can be provided which adds a domain that allows 

1 5 the protein to be bound to a matrix. For example, glutathione-S-transferase fusion proteins can 
be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined with the cell lysates (e.g., 35 S-labeled) 
and the candidate compound, and the mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and pH). Following incubation, the beads 

20 are washed to remove any unbound label, and the matrix immobilized and radiolabel 

determined directly, or in the supernatant after the complexes are dissociated. Alternatively, the 
complexes can be dissociated from the matrix, separated by SDS-PAGE, and the level of 
transporter-binding protein found in the bead fraction quantitated from the gel using standard 
electrophoretic techniques. For example, either the polypeptide or its target molecule can be 

25 immobilized utilizing conjugation of biotin and streptavidin using techniques well known in the 
art. Alternatively, antibodies reactive with the protein but which do not interfere with binding 
of the protein to its target molecule can be derivatized to the wells of the plate, and the protein 
trapped in the wells by antibody conjugation. Preparations of a transporter-binding protein and 
a candidate compound are incubated in the transporter protein-presenting wells and the amount 

30 of complex trapped in the well can be quantitated. Methods for detecting such complexes, in 
addition to those described above for the GST-immobilized complexes, include 
immunodetection of complexes using antibodies reactive with the transporter protein target 
molecule, or which are reactive with transporter protein and compete with the target molecule, 
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as well as enzyme-linked assays which rely on detecting an enzymatic activity associated with 
the target molecule. 

Agents that modulate one of the transporters of the present invention can be identified 
using one or more of the above assays, alone or in combination. It is generally preferable to use 
5 a cell-based or cell free system first and then confirm activity in an animal or other model 
system. Such model systems are well known in the art and can readily be employed in this 
context 

Modulators of transporter protein activity identified according to these drug screening 
assays can be used to treat a subject with a disorder mediated by the transporter pathway, by 

10 treating cells or tissues that express the transporter. Experimental data as provided in Figure 1 
indicates expression in humans in the organs such as lung, brain and prostate etc, as well as in 
different tissues. These methods of treatment include the steps of administering a modulator of 
transporter activity in a pharmaceutical composition to a subject in need of such treatment, the 
modulator being identified as described herein. 

15 In yet another aspect of the invention, the transporter proteins can be used as "bait 

proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J. Biol Chem. 268:12046-12054; 
Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi et al (1993) Oncogene 8:1693- 
1696; and Brent WO94/10300), to identify other proteins, which bind to or interact with the 

20 transporter and are involved in transporter activity. Such transporter-binding proteins are 
also likely to be involved in the propagation of signals by the transporter proteins or 
transporter targets as, for example, downstream elements of a transporter-mediated signaling 
pathway. Alternatively, such transporter-binding proteins are likely to be transporter 
inhibitors. 

25 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for a transporter protein 
is fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., 
GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that 

30 encodes an unidentified protein ("prey" or "sample") is fused to a gene that codes for the 

activation domain of the known transcription factor. If the "bait" and the "prey" proteins are 
able to interact, in vivo, forming a transporter-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 
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proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription factor 
can be isolated and used to obtain the cloned gene which encodes the protein which interacts 
5 with the transporter protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model. For example, an agent 
identified as described herein (e.g., a transporter-modulating agent, an antisense transporter 

10 nucleic acid molecule, a transporter-specific antibody, or a transporter-binding partner) can 
be used in an animal or other model to determine the efficacy, toxicity, or side effects of 
treatment with such an agent. Alternatively, an agent identified as described herein can be 
used in an animal or other model to determine the mechanism of action of such an agent. 
Furthermore, this invention pertains to uses of novel agents identified by the above-described 

15 screening assays for treatments as described herein. 

The transporter proteins of the present invention are also useful to provide a target for 
diagnosing a disease or predisposition to disease mediated by the peptide. Accordingly, the 
invention provides methods for detecting the presence, or levels o£ the protein (or encoding 
mRNA) in a cell, tissue, or organism. Experimental data as provided in Figure 1 indicates 

20 expression in humans in the organs such as lung, brain and prostate etc, as well as in different 
tissues. The method involves contacting a biological sample with a compound capable of 
interacting with the transporter protein such that the interaction can be detected. Such an assay 
can be provided in a single detection format or a multi-detection format such as an antibody 
chip array. 

25 One agent for detecting a protein in a sample is an antibody capable of selectively 

binding to protein. A biological sample includes tissues, cells and biological fluids isolated 
from a subject, as well as tissues, cells and fluids present within a subject. 

The peptides of the present invention also provide targets for diagnosing active protein 
activity, disease, or predisposition to disease, in a patient having a variant peptide, particularly 

30 activities and conditions that are known for other members of the family of proteins to which 

the present one belongs. Thus, the peptide can be isolated from a biological sample and assayed 
for the presence of a genetic mutation that results in aberrant peptide. This includes amino acid 
substitution, deletion, insertion, rearrangement, (as the result of aberrant splicing events), and 
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inappropriate post-translational modification. Analytic methods include altered electrophoretic 
mobility, altered tryptic peptide digest, altered transporter activity in cell-based or cell-free 
assay, alteration in ligand or antibody-binding pattern, altered isoelectric point, direct amino 
acid sequencing, and any other of the known assay techniques useful for detecting mutations in 
5 a protein. Such an assay can be provided in a single detection format or a multi-detection format 
such as an antibody chip array. 

In vitro techniques for detection of peptide include enzyme linked immunosorbent 
assays (ELIS As), Western blots, immunoprecipitations and immunofluorescence using a 
detection reagent, such as an antibody or protein binding agent. Alternatively, the peptide can 

10 be detected in vivo in a subject by introducing into the subject a labeled anti-peptide antibody or 
other types of detection agent. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 
techniques. Particularly useful are methods that detect the allelic variant of a peptide expressed 
in a subject and methods which detect fragments of a peptide in a sample. 

1 5 The peptides are also useful in pharmacogenomic analysis. Pharmacogenomics deal 

with clinically significant hereditary variations in the response to drugs due to altered drug 
disposition and abnormal action in affected persons. See, e.g., Eichelbaum, M. (Clin. Exp. 
Pharmacol. Physiol. 23(10-1 1):983-985 (1996)), and Linder, M.W. (Clin. Chem. 43(2):254-266 
(1997)). The clinical outcomes of these variations result in severe toxicity of therapeutic drugs 

20 in certain individuals or therapeutic failure of drugs in certain individuals as a result of 

individual variation in metabolism. Thus, the genotype of the individual can determine the way 
a therapeutic compound acts on the body or the way the body metabolizes the compound. 
Further, the activity of drug metabolizing enzymes effects both the intensity and duration of 
drug action. Thus, the pharmacogenomics of the individual permit the selection of effective 

25 compounds and effective dosages of such compounds for prophylactic or therapeutic treatment 
based on the individual's genotype. The discovery of genetic polymorphisms in some drug 
metabolizing enzymes has explained why some patients do not obtain the expected drug effects, 
show an exaggerated drug effect, or experience serious toxicity from standard drug dosages. 
Polymorphisms can be expressed in the phenotype of the extensive metabolizer and the 

30 phenotype of the poor metabolizer. Accordingly, genetic polymorphism may lead to allelic 
protein variants of the transporter protein in which one or more of the transporter functions in 
one population is different from those in another population. The peptides thus allow a target to 
ascertain a genetic predisposition that can affect treatment modality. Thus, in a ligand-based 
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treatment, polymorphism may give rise to amino terminal extracellular domains and/or other 
ligand-binding regions that are more or less active in ligand binding, and transporter activation. 
Accordingly, ligand dosage would necessarily be modified to maximize the therapeutic effect 
within a given population containing a polymorphism. As an alternative to genotyping, specific 
5 polymorphic peptides could be identified. 

The peptides are also useful for treating a disorder characterized by an absence of, 
inappropriate, or unwanted expression of the protein. Experimental data as provided in Figure 1 
indicates expression in humans in the organs such as lung, brain and prostate etc, as well as in 
different tissues. Accordingly, methods for treatment include the use of the transporter protein 
10 or fragments. 

Antibodies 

The invention also provides antibodies that selectively bind to one of the peptides of the 
present invention, a protein comprising such a peptide, as well as variants and fragments 

1 5 thereof. As used herein, an antibody selectively binds a target peptide when it binds the target 
peptide and does not significantly bind to unrelated proteins. An antibody is still considered to 
selectively bind a peptide even if it also binds to other proteins that are not substantially 
homologous with the target peptide so long as such proteins share homology with a fragment or 
domain of the peptide target of the antibody. In this case, it would be understood that antibody 

20 binding to the peptide is still selective despite some degree of cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized within 
the art: they are multi-subunit proteins produced by a mammalian organism in response to an 
antigen challenge. The antibodies of the present invention include polyclonal antibodies and 
monoclonal antibodies, as well as fragments of such antibodies, including, but not limited to, 

25 Fab or F(ab ') 2 , and Fv fragments. 

Many methods are known for generating and/or identifying antibodies to a given target 
peptide. Several such methods are described by Harlow, Antibodies, Cold Spring Harbor Press, 
(1989). 

In general, to generate antibodies, an isolated peptide is used as an immunogen and is 
30 administered to a mammalian organism, such as a rat, rabbit or mouse. The full-length protein, 
an antigenic peptide fragment or a fusion protein can be used. Particularly important fragments 
are those covering functional domains, such as the domains identified in Figure 2, and domain 

31 



WO 03/076644 





LCT/US03/06667 



of sequence homology or divergence amongst the family, such as those that can readily be 
identified using protein alignment methods and as presented in the Figures. 

Antibodies are preferably prepared from regions or discrete fragments of the 
transporter proteins. Antibodies can be prepared from any region of the peptide as described 
5 herein. However, preferred regions will include those involved in function/activity and/or 
transporter/binding partner interaction. Figure 2 can be used to identify particularly 
important regions while sequence alignment can be used to identify conserved and unique 
sequence fragments. 



10 The antigenic peptide can comprise, however, at least 10, 12, 14, 16 or more amino acid 

residues. Such fragments can be selected on a physical property, such as fragments correspond 
to regions that are located on the surface of the protein, e.g., hydrophilic regions or can be 
selected based on sequence uniqueness (see Figure 2). 



1 5 physically linking) the antibody to a detectable substance. Examples of detectable substances 
include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, P-galactosidase, or acetylcholinesterase; 
examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; 

20 examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include l2 % l2 % 35 S or 3 H. 



Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by 
standard techniques, such as affinity chromatography or immunoprecipitation. The antibodies 
can facilitate the purification of the natural protein from cells and recombinantly produced 
30 protein expressed in host cells. In addition, such antibodies are useful to detect the presence of 
one of the proteins of the present invention in cells or tissues to determine the pattern of 
expression of the protein among various tissues in an organism and over the course of normal 
development. Experimental data as provided in Figure 1 indicates that transporter proteins of 



An antigenic fragment will typically comprise at least 8 contiguous amino acid residues. 



Detection on an antibody of the present invention can be facilitated by coupling (i.e. 
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the present invention are expressed in the human lung, brain, prostate, ovary, placenta, 
thymus, colon, and pancreas. Specifically, the protein also expressed in the tissues such as 
small cell carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, adenocarcinoma, 
fibrotheoma, pooled germ cell tumors and Islets of Langerhans. Further, such antibodies can 
S be used to detect protein in situ, in vitro, or in a cell lysate or supernatant in order to evaluate the 
abundance and pattern of expression. Also, such antibodies can be used to assess abnormal 
tissue distribution or abnormal expression during development or progression of a biological 
condition. Antibody detection of circulating fragments of the full length protein can be used to 
identify turnover. 

10 Further, the antibodies can be used to assess expression in disease states such as in 

active stages of the disease or in an individual with a predisposition toward disease related to 
the protein's function. When a disorder is caused by an inappropriate tissue distribution, 
developmental expression, level of expression of the protein, or expressed/processed form, the 
antibody can be prepared against the normal protein. Experimental data as provided in Figure 1 

15 indicates expression in humans in the organs such as lung, brain and prostate etc, as well as in 
different tissues. If a disorder is characterized by a specific mutation in the protein, antibodies 
specific for this mutant protein can be used to assay for the presence of the specific mutant 
protein. 



20 cells in the various tissues in an organism. Experimental data as provided in Figure 1 indicates 
expression in humans in the organs such as lung, brain and prostate etc, as well as in different 
tissues. The diagnostic uses can be applied, not only in genetic testing, but also in monitoring a 
treatment modality. Accordingly, where treatment is ultimately aimed at correcting expression 
level or the presence of aberrant sequence and aberrant tissue distribution or developmental 

25 expression, antibodies directed against the protein or relevant fragments can be used to monitor 
therapeutic efficacy. 

Additionally, antibodies are useful in pharmacogenomic analysis. Thus, antibodies 
prepared against polymorphic proteins can be used to identify individuals that require modified 
treatment modalities. The antibodies are also useful as diagnostic tools as an immunological 
30 marker for aberrant protein analyzed by electrophoretic mobility, isoelectric point, tryptic 
peptide digest, and other physical assays known to those in the art. 

The antibodies are also useful for tissue typing. Experimental data as provided in 
Figure 1 indicates expression in humans in the organs such as lung, brain and prostate etc, as 



The antibodies can also be used to assess normal and aberrant subcellular localization of 
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well as in different tissues. Thus, where a specific protein has been correlated with expression 
in a specific tissue, antibodies that are specific for this protein can be used to identify a tissue 
type. 

The antibodies are also useful for inhibiting protein function, for example, blocking the 
5 binding of the transporter peptide to a binding partner such as a ligand or protein binding 
partner. These uses can also be applied in a therapeutic context in which treatment involves 
inhibiting the protein's function. An antibody can be used, for example, to block binding, thus 
modulating (agonizing or antagonizing) the peptides activity. Antibodies can be prepared 
against specific fragments containing sites required for function or against intact protein that is 

1 0 associated with a cell or cell membrane. See Figure 2 for structural information relating to the 
proteins of the present invention. 

The invention also encompasses kits for using antibodies to detect the presence of a 
protein in a biological sample. The kit can comprise antibodies such as a labeled or labelable 
antibody and a compound or agent for detecting protein in a biological sample; means for 

1 5 determining the amount of protein in the sample; means for comparing the amount of protein in 
the sample with a standard; and instructions for use. Such a kit can be supplied to detect a single 
protein or epitope or can be configured to detect one of a multitude of epitopes, such as in an 
antibody detection array. Arrays are described in detail below for nucleic acid arrays and 
similar methods have been developed for antibody arrays. 



Nucleic Acid Molecules 

The present invention further provides isolated nucleic acid molecules that encode a 
transporter peptide or protein of the present invention (cDNA, transcript and genomic 
sequence). Such nucleic acid molecules will consist of, consist essentially of, or comprise a 

25 nucleotide sequence that encodes one of the transporter peptides of the present invention, an 
allelic variant thereof, or an ortholog or paralog thereof. 

As used herein, an "isolated" nucleic acid molecule is one that is separated from other 
nucleic acid present in the natural source of the nucleic acid. Preferably, an "isolated" nucleic 
acid is free of sequences that naturally flank the nucleic acid (i.e., sequences located at the 5* 

30 and 3* ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. However, there can be some flanking nucleotide sequences, for example up to 
about 5KB, 4KB, 3KB, 2KB, or 1KB or less, particularly contiguous peptide encoding 
sequences and peptide encoding sequences within the same gene but separated by introns in the 
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genomic sequence. The important point is that the nucleic acid is isolated from remote and 
unimportant flanking sequences such that it can be subjected to the specific manipulations 
described herein such as recombinant expression, preparation of probes and primers, and other 
uses specific to the nucleic acid sequences. 
5 Moreover, an "isolated" nucleic acid molecule, such as a transcript/cDNA molecule, can 

be substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or chemical precursors or other chemicals when chemically 
synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

1 0 For example, recombinant DNA molecules contained in a vector are considered 

isolated. Further examples of isolated DNA molecules include recombinant DNA molecules 
maintained in heterologous host cells or purified (partially or substantially) DNA molecules in 
solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the isolated 
DNA molecules of the present invention. Isolated nucleic acid molecules according to the 

1 5 present invention further include such molecules produced synthetically. 

Accordingly, the present invention provides nucleic acid molecules that consist of the 
nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO: 1, transcript sequence and SEQ ID 
NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein provided in 
Figure 2, SEQ ID NO:2. A nucleic acid molecule consists of a nucleotide sequence when the 

20 nucleotide sequence is the complete nucleotide sequence of the nucleic acid molecule. 

The present invention further provides nucleic acid molecules that consist essentially of 
the nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO:l, transcript sequence and SEQ ID 
NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein provided in 
Figure 2, SEQ ID NO:2. A nucleic acid molecule consists essentially of a nucleotide sequence 

25 when such a nucleotide sequence is present with only a few additional nucleic acid residues in 
the final nucleic acid molecule. 

The present invention further provides nucleic acid molecules that comprise the 
nucleotide sequences shown in Figure 1 or 3 (SEQ ID NO: 1, transcript sequence and SEQ ID 
NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein provided in 

30 Figure 2, SEQ ID NO:2. A nucleic acid molecule comprises a nucleotide sequence when the 

nucleotide sequence is at least part of the final nucleotide sequence of the nucleic acid molecule. 
In such a fashion, the nucleic acid molecule can be only the nucleotide sequence or have 
additional nucleic acid residues, such as nucleic acid residues that are naturally associated with 
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it or heterologous nucleotide sequences. Such a nucleic acid molecule can have a few 
additional nucleotides or can comprise several hundred or more additional nucleotides. A brief 
description of how various types of these nucleic acid molecules can be readily made/isolated is 
provided below. 

5 In Figures 1 and 3, both coding and non-coding sequences are provided. Because of 

the source of the present invention, humans genomic sequence (Figure 3) and 
cDNA/transcript sequences (Figure 1), the nucleic acid molecules in the Figures will contain 
genomic intronic sequences, 5' and 3' non-coding sequences, gene regulatory regions and 
non-coding intergenic sequences. In general such sequence features are either noted in 

10 Figures 1 and 3 or can readily be identified using computational tools known in the art. As 
discussed below, some of the non-coding regions, particularly gene regulatory elements such 
as promoters, are useful for a variety of purposes, e.g. control of heterologous gene 
expression, target for identifying gene activity modulating compounds, and are particularly 
claimed as fragments of the genomic sequence provided herein. 

1 5 The isolated nucleic acid molecules can encode the mature protein plus additional 

amino or carboxyl-terminal amino acids, or amino acids interior to the mature peptide (when 
the mature form has more than one peptide chain, for instance). Such sequences may play a 
role in processing of a protein from precursor to a mature form, facilitate protein trafficking, 
prolong or shorten protein half-life or facilitate manipulation of a protein for assay or 

20 production, among other things. As generally is the case in situ, the additional amino acids may 
be processed away from the mature protein by cellular enzymes. 

As mentioned above, the isolated nucleic acid molecules include, but are not limited to, 
the sequence encoding the transporter peptide alone, the sequence encoding the mature peptide 
and additional coding sequences, such as a leader or secretory sequence (e.g., a pre -pro or pro- 

25 protein sequence), the sequence encoding the mature peptide, with or without the additional 

coding sequences, plus additional non-coding sequences, for example introns and non-coding 5' 
and 3' sequences such as transcribed but non-translated sequences that play a role in 
transcription, mRNA processing (including splicing and polyadenylation signals), ribosome 
binding and stability of mRNA. In addition, the nucleic acid molecule may be fused to a 

30 marker sequence encoding, for example, a peptide that facilitates purification. 

Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the 
form DNA, including cDNA and genomic DNA obtained by cloning or produced by chemical 
synthetic techniques or by a combination thereof. The nucleic acid, especially DNA, can be 
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double-stranded or single-stranded. Single-stranded nucleic acid can be the coding strand 
(sense strand) or the non-coding strand (anti-sense strand). 

The invention further provides nucleic acid molecules that encode fragments of the 
peptides of the present invention as well as nucleic acid molecules that encode obvious variants 
5 of the transporter proteins of the present invention that are described above. Such nucleic acid 
molecules may be naturally occurring, such as allelic variants (same locus), paralogs (different 
locus), and orthologs (different organism), or may be constructed by recombinant DNA 
methods or by chemical synthesis. Such non-naturally occurring variants may be made by 
mutagenesis techniques, including those applied to nucleic acid molecules, cells, or organisms. 
1 0 Accordingly, as discussed above, the variants can contain nucleotide substitutions, deletions, 
inversions and insertions. Variation can occur in either or both the coding and non-coding 
regions. The variations can produce both conservative and non-conservative amino acid 
substitutions. 



1 5 molecules provided in Figures 1 and 3. Preferred non-coding fragments include, but are not 
limited to, promoter sequences, enhancer sequences, gene modulating sequences and gene 
termination sequences. Such fragments are useful in controlling heterologous gene expression 
and in developing screens to identify gene-modulating agents. A promoter can readily be 
identified as being 5' to the ATG start site in the genomic sequence provided in Figure 3. 

20 A fragment comprises a contiguous nucleotide sequence greater than 12 or more 

nucleotides. Further, a fragment could at least 30, 40, 50, 100, 250 or 500 nucleotides in length. 
The length of the fragment will be based on its intended use. For example, the fragment can 
encode epitope bearing regions of the peptide, or can be useful as DNA probes and primers. 
Such fragments can be isolated using the known nucleotide sequence to synthesize an 

25 oligonucleotide probe. A labeled probe can then be used to screen a cDNA library, genomic 
DNA library, or mRNA to isolate nucleic acid corresponding to the coding region. Further, 
primers can be used in PCR reactions to clone specific regions of gene. 

A probe/primer typically comprises substantially a purified oligonucleotide or 
oligonucleotide pair. The oligonucleotide typically comprises a region of nucleotide sequence 

30 that hybridizes under stringent conditions to at least about 12, 20, 25, 40, 50 or more 
consecutive nucleotides. 

Orthologs, homologs, and allelic variants can be identified using methods well known in 
the art. As described in the Peptide Section, these variants comprise a nucleotide sequence 



The present invention further provides non-coding fragments of the nucleic acid 
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encoding a peptide that is typically 60-70%, 70-80%, 80-90%, and more typically at least about 
90-95% or more homologous to the nucleotide sequence shown in the Figure sheets or a 
fragment of this sequence. Such nucleic acid molecules can readily be identified as being able 
to hybridize under moderate to stringent conditions, to the nucleotide sequence shown in the 
Figure sheets or a fragment of the sequence. Allelic variants can readily be determined by 
genetic locus of the encoding gene. The gene encoding the novel transporter protein of the 
present invention is located on a genome component that has been mapped to human 
chromosome 6 (as indicated in Figure 3), which is supported by multiple lines of evidence, 
such as STS and BAC map data. 

Figure 3 provides information on SNPs that have been identified in a gene encoding 
the transporter protein of the present invention. 94 SNP variants were found, including 10 
indels (indicated by a "-") and ISNPs in exons. SNPs, identified at different nucleotide 
positions in introns and regions 5* and 3* of the ORF, may affect control/regulatory 
elements. 

As used herein, the term •'hybridizes under stringent conditions" is intended to describe 
conditions for hybridization and washing under which nucleotide sequences encoding a peptide 
at least 60-70% homologous to each other typically remain hybridized to each other. The 
conditions can be such that sequences at least about 60%, at least about 70%, or at least about 
80% or more homologous to each other typically remain hybridized to each other. Such 
stringent conditions are known to those skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 
45C, followed by one or more washes in 0.2 X SSC, 0. 1% SDS at 50-65C. Examples of 
moderate to low stringency hybridization conditions are well known in the art. 

Nucleic Acid Molecule Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, 
chemical intermediates, and in biological assays. The nucleic acid molecules are useful as a 
hybridization probe for messenger RNA, transcript/cDNA and genomic DNA to isolate full- 
length cDNA and genomic clones encoding the peptide described in Figure 2 and to isolate 
cDNA and genomic clones that correspond to variants (alleles, orthologs, etc.) producing the 
same or related peptides shown in Figure 2. 94 SNPs, including 10 indels, have been identified 
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in the gene encoding the transporter protein provided by the present invention and are given in 
Figure 3. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the Figures. Accordingly, it could be derived from 5' noncoding regions, 
5 the coding region, and 3' noncoding regions. However, as discussed, fragments are not to be 
construed as encompassing fragments disclosed prior to the present invention. 

The nucleic acid molecules are also useful as primers for PCR to amplify any given 
region of a nucleic acid molecule and are useful to synthesize antisense molecules of desired 
length and sequence. 

10 The nucleic acid molecules are also useful for constructing recombinant vectors. Such 

vectors include expression vectors that express a portion of, or all of, the peptide sequences. 
Vectors also include insertion vectors, used to integrate into another nucleic acid molecule 
sequence, such as into the cellular genome, to alter in situ expression of a gene and/or gene 
product. For example, an endogenous coding sequence can be replaced via homologous 

1 5 recombination with all or part of the coding region containing one or more specifically 
introduced mutations. 

The nucleic acid molecules are also useful for expressing antigenic portions of the 
proteins. 



20 positions of the nucleic acid molecules by means of in situ hybridization methods. The gene 
encoding the novel transporter protein of the present invention is located on a genome 
component that has been mapped to human chromosome 6 (as indicated in Figure 3), which is 
supported by multiple lines of evidence, such as STS and B AC map data. 



25 regulatory regions of the nucleic acid molecules of the present invention. 

The nucleic acid molecules are also useful for designing ribozymes corresponding to all, 
or a part, of the mRNA produced from the nucleic acid molecules described herein. 

The nucleic acid molecules are also useful for making vectors that express part, or all, of 
the peptides. 

30 The nucleic acid molecules are also useful for constructing host cells expressing a part, 

or all, of the nucleic acid molecules and peptides. 

The nucleic acid molecules are also useful for constructing transgenic animals 
expressing all, or a part, of the nucleic acid molecules and peptides. 



The nucleic acid molecules are also useful as probes for determining the chromosomal 



The nucleic acid molecules are also useful in making vectors containing the gene 
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The nucleic acid molecules are also useful as hybridization probes for determining the 
presence, level, form and distribution of nucleic acid expression. Experimental data as 
provided in Figure 1 indicates that transporter proteins of the present invention are expressed 
in the human lung, brain, prostate, ovary, placenta, thymus, colon, and pancreas. 
5 Specifically, the protein also expressed in the tissues such as small cell carcinoma, liver, 
neuroblastoma cells, pooled germ cell tumors, adenocarcinoma, fibrotheoma, pooled germ 
cell tumors and Islets of Langerhans. 

Accordingly, the probes can be used to detect the presence of, or to determine levels of, 
a specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic acid whose 
1 0 level is determined can be DNA or RNA. Accordingly, probes corresponding to the peptides 
described herein can be used to assess expression and/or gene copy number in a given cell, 
tissue, or organism. These uses are relevant for diagnosis of disorders involving an increase or 
decrease in transporter protein expression relative to normal results. 

In vitro techniques for detection of mRNA include Northern hybridizations and in situ 
15 hybridizations. In vitro techniques for detecting DNA include Southern hybridizations and in 
situ hybridization. 

Probes can be used as a part of a diagnostic test kit for identifying cells or tissues that 
express a transporter protein, such as by measuring a level of a transporter-encoding nucleic 
acid in a sample of cells from a subject e.g., mRNA or genomic DNA, or determining if a 

20 transporter gene has been mutated. Experimental data as provided in Figure 1 indicates that 
transporter proteins of the present invention are expressed in the human lung, brain, prostate, 
ovary, placenta, thymus, colon, and pancreas. Specifically, the protein also expressed in the 
tissues such as small cell carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, 
adenocarcinoma, fibrotheoma, pooled germ cell tumors and Islets of Langerhans. 

25 Nucleic acid expression assays are useful for drug screening to identify compounds that 

modulate transporter nucleic acid expression. 

The invention thus provides a method for identifying a compound that can be used to 
treat a disorder associated with nucleic acid expression of the transporter gene, particularly 
biological and pathological processes that are mediated by the transporter in cells and tissues 

30 that express it. Experimental data as provided in Figure 1 indicates expression in humans in the 
organs such as lung, brain and prostate etc, as well as in different tissues. The method 
typically includes assaying the ability of the compound to modulate the expression of the 
transporter nucleic acid and thus identifying a compound that can be used to treat a disorder 
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characterized by undesired transporter nucleic acid expression. The assays can be performed in 
cell-based and cell-free systems. Cell-based assays include cells naturally expressing the 
transporter nucleic acid or recombinant cells genetically engineered to express specific nucleic 
acid sequences. 

5 The assay for transporter nucleic acid expression can involve direct assay of nucleic 

acid levels, such as mRNA levels, or on collateral compounds involved in the signal pathway. 
Further, the expression of genes that are up- or down-regulated in response to the transporter 
protein signal pathway can also be assayed. In this embodiment the regulatory regions of these 
genes can be operably linked to a reporter gene such as luciferase. 

10 Thus, modulators of transporter gene expression can be identified in a method wherein a 

cell is contacted with a candidate compound and the expression of mRNA determined. The 
level of expression of transporter mRNA in the presence of the candidate compound is 
compared to the level of expression of transporter mRNA in the absence of the candidate 
compound. The candidate compound can then be identified as a modulator of nucleic acid 

1 5 expression based on this comparison and be used, for example to treat a disorder characterized 
by aberrant nucleic acid expression. When expression of mRNA is statistically significantly 
greater in the presence of the candidate compound than in its absence, the candidate compound 
is identified as a stimulator of nucleic acid expression. When nucleic acid expression is 
statistically significantly less in the presence of the candidate compound than in its absence, the 

20 candidate compound is identified as an inhibitor of nucleic acid expression. 

The invention further provides methods of treatment, with the nucleic acid as a target, 
using a compound identified through drug screening as a gene modulator to modulate 
transporter nucleic acid expression in cells and tissues that express the transporter. 
Experimental data as provided in Figure 1 indicates that transporter proteins of the present 

25 invention are expressed in the human lung, brain, prostate, ovary, placenta, thymus, colon, 
and pancreas. Specifically, the protein also expressed in the tissues such as small cell 
carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, adenocarcinoma, 
fibrotheoma, pooled germ cell tumors and Islets of Langerhans. Modulation includes both 
up-regulation (i.e. activation or agonization) or down-regulation (suppression or antagonization) 

30 or nucleic acid expression. 

Alternatively, a modulator for transporter nucleic acid expression can be a small 
molecule or drug identified using the screening assays described herein as long as the drug or 
small molecule inhibits the transporter nucleic acid expression in the cells and tissues that 
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express the protein. Experimental data as provided in Figure 1 indicates expression in humans 



5 in a treatment regimen. Thus, the gene expression pattern can serve as a barometer for the 
continuing effectiveness of treatment with the compound, particularly with compounds to 
which a patient can develop resistance. The gene expression pattern can also serve as a marker 
indicative of a physiological response of the affected cells to the compound. Accordingly, such 
monitoring would allow either increased administration of the compound or the administration 

10 of alternative compounds to which the patient has not become resistant. Similarly, if the level 
of nucleic acid expression falls below a desirable level, administration of the compound could 
be commensurately decreased. 

The nucleic acid molecules are also useful in diagnostic assays for qualitative changes 
in transporter nucleic acid expression, and particularly in qualitative changes that lead to 

1 5 pathology. The nucleic acid molecules can be used to detect mutations in transporter genes and 
gene expression products such as mRNA. The nucleic acid molecules can be used as 
hybridization probes to detect naturally occurring genetic mutations in the transporter gene and 
thereby to determine whether a subject with the mutation is at risk for a disorder caused by the 
mutation. Mutations include deletion, addition, or substitution of one or more nucleotides in the 

20 gene, chromosomal rearrangement, such as inversion or transposition, modification of genomic 
DNA, such as aberrant methylation patterns or changes in gene copy number, such as 
amplification. Detection of a mutated form of the transporter gene associated with a 
dysfunction provides a diagnostic tool for an active disease or susceptibility to disease when the 
disease results from overexpression, underexpression, or altered expression of a transporter 

25 protein. 

Individuals carrying mutations in the transporter gene can be detected at the nucleic acid 
level by a variety of techniques. Figure 3 provides information on SNPs that have been 
identified in a gene encoding the transporter protein of the present invention. 94 SNP 
variants were found, including 10 indels (indicated by a "-") and 1 SNPs in exons. SNPs, 
30 identified at different nucleotide positions in introns and regions 5' and 3* of the ORF, may 
affect control/regulatory elements. The gene encoding the novel transporter protein of the 
present invention is located on a genome component that has been mapped to human 
chromosome 6 (as indicated in Figure 3), which is supported by multiple lines of evidence, 



in the organs such as lung, brain and prostate etc, as well as in different tissues. 

The nucleic acid molecules are also useful for monitoring the effectiveness of 
modulating compounds on the expression or activity of the transporter gene in clinical trials or 
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such as STS and BAC map data. Genomic DNA can be analyzed directly or can be amplified 
by using PCR prior to analysis. RNA or cDNA can be used in the same way. In some uses, 
detection of the mutation involves the use of a probe/primer in a polymerase chain reaction 
(PCR) (see, e.g. U.S. Patent Nos. 4,683,195 and 4,683,202), such as anchor PCR or RACE 
5 PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et al y Science 
247:1077-1080 (1988); and Nakazawa et a/., PNAS P/:360-364 (1994)), the latter of which can 
be particularly useful for detecting point mutations in the gene (see Abravaya et ai 9 Nucleic 
Acids Res. 23:675-682 (1995)). This method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the 

1 0 sample, contacting the nucleic acid sample with one or more primers which specifically 

hybridize to a gene under conditions such that hybridization and amplification of the gene (if 
present) occurs, and detecting the presence or absence of an amplification product, or detecting 
the size of the amplification product and comparing the length to a control sample. Deletions 
and insertions can be detected by a change in size of the amplified product compared to the 

1 5 normal genotype. Point mutations can be identified by hybridizing amplified DNA to normal 
RNA or antisense DNA sequences. 

Alternatively, mutations in a transporter gene can be directly identified, for example, by 
alterations in restriction enzyme digestion patterns determined by gel electrophoresis. 

Further, sequence-specific ribozymes (U.S. Patent No. 5,498,531) can be used to score 

20 for the presence of specific mutations by development or loss of a ribozyme cleavage site. 
Perfectly matched sequences can be distinguished from mismatched sequences by nuclease 
cleavage digestion assays or by differences in melting temperature. 

Sequence changes at specific locations can also be assessed by nuclease protection 
assays such as RNase and SI protection or the chemical cleavage method. Furthermore, 

25 sequence differences between a mutant transporter gene and a wild-type gene can be 

determined by direct DNA sequencing. A variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays (Naeve, C.W., (1995) Biotechniques 7P:448), 
including sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
94/16101; Cohen et ai, Adv. Chromatogr. 35:127-162 (1996); and Griffin et al.Appl 

30 Biochem. Biotechnol 35:147-159 (1993)). 

Other methods for detecting mutations in the gene include methods in which protection 
from cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DNA duplexes 
(Myers et al., Science 230:1242 (1985)); Cotton et al. 9 PNAS 55:4397 (1988); Saleeba et aL 9 
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Meth. Enzymol 2/7:286-295 (1992)), electrophoretic mobility of mutant and wild type nucleic 
acid is compared (Orita et al y PNAS 86:2766 (1989); Cotton et aL, MutaL Res. 255:125-144 
(1993); and Hayashi et al y Genet. Anal Tech. Appl 9:13-19 (1992)), and movement of mutant 
or wild-type fragments in polyacrylamide gels containing a gradient of denaturant is assayed 
5 using denaturing gradient gel electrophoresis (Myers et aL, Nature 313:495 (1985)). Examples 
of other techniques for detecting point mutations include selective oligonucleotide 
hybridization, selective amplification, and selective primer extension. 

The nucleic acid molecules are also useful for testing an individual for a genotype that 
while not necessarily causing the disease, nevertheless affects the treatment modality. Thus, the 

1 0 nucleic acid molecules can be used to study the relationship between an individual's genotype 
and the individual's response to a compound used for treatment (pharmacogenomic 
relationship). Accordingly, the nucleic acid molecules described herein can be used to assess 
the mutation content of the transporter gene in an individual in order to select an appropriate 
compound or dosage regimen for treatment. Figure 3 provides information on SNPs that have 

15 been identified in a gene encoding the transporter protein of the present invention. 94 SNP 
variants were found, including 10 indels (indicated by a "-") and ISNPs in exons. SNPs, 
identified at different nucleotide positions in introns and regions 5' and 3* of the ORF, may 
affect control/regulatory elements. 

Thus nucleic acid molecules displaying genetic variations that affect treatment provide a 

20 diagnostic target that can be used to tailor treatment in an individual. Accordingly, the 

production of recombinant cells and animals containing these polymorphisms allow effective 
clinical design of treatment compounds and dosage regimens. 

The nucleic acid molecules are thus useful as antisense constructs to control transporter 
gene expression in cells, tissues, and organisms. A DNA antisense nucleic acid molecule is 

25 designed to be complementary to a region of the gene involved in transcription, preventing 
transcription and hence production of transporter protein. An antisense RNA or DNA nucleic 
acid molecule would hybridize to the mRNA and thus block translation of mRNA into 
transporter protein. 

Alternatively, a class of antisense molecules can be used to inactivate mRNA in order to 
30 decrease expression of transporter nucleic acid. Accordingly, these molecules can treat a 
disorder characterized by abnormal or undesired transporter nucleic acid expression. This 
technique involves cleavage by means of ribozymes containing nucleotide sequences 
complementary to one or more regions in the mRNA that attenuate the ability of the mRNA to 
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be translated. Possible regions include coding regions and particularly coding regions 
corresponding to the catalytic and other functional activities of the transporter protein, such as 
ligand binding. 

The nucleic acid molecules also provide vectors for gene therapy in patients containing 
5 cells that are aberrant in transporter gene expression. Thus, recombinant cells, which include 
the patient's cells that have been engineered ex vivo and returned to the patient, are introduced 
into an individual where the cells produce the desired transporter protein to treat the individual. 

The invention also encompasses kits for detecting the presence of a transporter nucleic 
acid in a biological sample. Experimental data as provided in Figure 1 indicates that 

10 transporter proteins of the present invention are expressed in the human lung, brain, prostate, 
ovary, placenta, thymus, colon, and pancreas. Specifically, the protein also expressed in the 
tissues such as small cell carcinoma, liver, neuroblastoma cells, pooled germ cell tumors, 
adenocarcinoma, fibrotheoma, pooled germ cell tumors and Islets of Langerhans. For 
example, the kit can comprise reagents such as a labeled or labelable nucleic acid or agent 

15 capable of detecting transporter nucleic acid in a biological sample; means for determining the 
amount of transporter nucleic acid in the sample; and means for comparing the amount of 
transporter nucleic acid in the sample with a standard. The compound or agent can be packaged 
in a suitable container. The kit can further comprise instructions for using the kit to detect 
transporter protein mRNA or DNA. 



Nucleic Acid Arrays 

The present invention further provides nucleic acid detection kits, such as arrays or 
microarrays of nucleic acid molecules that are based on the sequence information provided 
in Figures 1 and 3 (SEQ ID NOS:l and 3). 



polynucleotides or oligonucleotides synthesized on a substrate, such as paper, nylon or other 
type of membrane, filter, chip, glass slide, or any other suitable solid support. In one 
embodiment, the microarray is prepared and used according to the methods described in US 
Patent 5,837,832, Chee et al, PCT application W095/1 1995 (Chee et al) 9 Lockhart, D. J. et 
30 al (1996; Nat. Biotech. 14: 1675-1680) and Schena, M. et al (1996; Proc. Natl. Acad. Sci. 
93: 10614-10619), all of which are incorporated herein in their entirety by reference. In 
other embodiments, such arrays are produced by the methods described by Brown et al, US 
Patent No. 5,807,522. 



20 



25 



As used herein "Arrays" or "Microarrays 1 



i" refers to an array of distinct 
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The microarray or detection kit is preferably composed of a large number of unique, 
single-stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6-60 
nucleotides in length, more preferably 15-30 nucleotides in length, and most preferably about 
5 20-25 nucleotides in length. For a certain type of microarray or detection kit, it may be 

preferable to use oligonucleotides that are only 7-20 nucleotides in length. The microarray or 
detection kit may contain oligonucleotides that cover the known 5\ or 3', sequence, 
sequential oligonucleotides that cover the full length sequence; or unique oligonucleotides 
selected from particular areas along the length of the sequence. Polynucleotides used in the 
10 microarray or detection kit may be oligonucleotides that are specific to a gene or genes of 
interest. 

In order to produce oligonucleotides to a known sequence for a microarray or 
detection kit, the gene(s) of interest (or an ORF identified from the contigs of the present 
invention) is typically examined using a computer algorithm which starts at the 5 f or at the 3' 

15 end of the nucleotide sequence. Typical algorithms will then identify oligomers of defined 
length that are unique to the gene, have a GC content within a range suitable for 
hybridization, and lack predicted secondary structure that may interfere with hybridization. 
In certain situations it may be appropriate to use pairs of oligonucleotides on a microarray or 
detection kit. The "pairs" will be identical, except for one nucleotide that preferably is 

20 located in the center of the sequence. The second oligonucleotide in the pair (mismatched by 
one) serves as a control. The number of oligonucleotide pairs may range from two to one 
million. The oligomers are synthesized at designated areas on a substrate using a light- 
directed chemical process. The substrate may be paper, nylon or other type of membrane, 
filter, chip, glass slide or any other suitable solid support. 

25 In another aspect, an oligonucleotide may be synthesized on the surface of the 

substrate by using a chemical coupling procedure and an ink jet application apparatus, as 
described in PCT application W095/251 1 16 (Baldeschweiler et al.) which is incorporated 
herein in its entirety by reference. In another aspect, a "gridded" array analogous to a dot (or 
slot) blot may be used to arrange and link cDNA fragments or oligonucleotides to the surface 

30 of a substrate using a vacuum system, thermal, UV, mechanical or chemical bonding 

procedures. An array, such as those described above, may be produced by hand or by using 
available devices (slot blot or dot blot apparatus), materials (any suitable solid support), and 
machines (including robotic instruments), and may contain 8, 24, 96, 384, 1536, 6144 or 
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more oligonucleotides, or any other number between two and one million which lends itself 
to the efficient use of commercially available instrumentation. 

In order to conduct sample analysis using a microarray or detection kit, the RNA or 
DNA from a biological sample is made into hybridization probes. The mRNA is isolated, 
5 and cDNA is produced and used as a template to make antisense RNA (aRNA). The aRNA 
is amplified in the presence of fluorescent nucleotides, and labeled probes are incubated with 
the microarray or detection kit so that the probe sequences hybridize to complementary 
oligonucleotides of the microarray or detection kit. Incubation conditions are adjusted so that 
hybridization occurs with precise complementary matches or with various degrees of less 

10 complementarity. After removal of nonhybridized probes, a scanner is used to determine the 
levels and patterns of fluorescence. The scanned images are examined to determine degree of 
complementarity and the relative abundance of each oligonucleotide sequence on the 
microarray or detection kit. The biological samples may be obtained from any bodily fluids 
(such as blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or other 

15 tissue preparations. A detection system may be used to measure the absence, presence, and 
amount of hybridization for all of the distinct sequences simultaneously. This data may be 
used for large-scale correlation studies on the sequences, expression patterns, mutations, 
variants, or polymorphisms among samples. 



20 of the transporter proteins/peptides of the present invention. In detail, such methods 

comprise incubating a test sample with one or more nucleic acid molecules and assaying for 
binding of the nucleic acid molecule with components within the test sample. Such assays 
will typically involve arrays comprising many genes, at least one of which is a gene of the 
present invention and or alleles of the transporter gene of the present invention. Figure 3 

25 provides information on SNPs that have been identified in a gene encoding the transporter 
protein of the present invention. 94 SNP variants were found, including 10 indels (indicated 
by a "-") and ISNPs in exons. SNPs, identified at different nucleotide positions in introns 
and regions 5' and 3* of the ORF, may affect control/regulatory elements. 



30 Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid molecule used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or array assay formats can readily be adapted to employ the novel fragments of 



Using such arrays, the present invention provides methods to identify the expression 



Conditions for incubating a nucleic acid molecule with a test sample vary. 
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the Human genome disclosed herein. Examples of such assays can be found in Chard, T, An 
Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G. R. et al, Techniques in 
Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1 982), Vol. 2 (1983), Vol. 3 
5 (1985); Tijssen, P., Practice and Theory of Enzyme Immunoassays: Laboratory Techniques 
in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 
Netherlands (1985). 

The test samples of the present invention include cells, protein or membrane extracts 
of cells. The test sample used in the above-described method will vary based on the assay 
10 format, nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing nucleic acid extracts or of cells are well known in the art 
and can be readily be adapted in order to obtain a sample that is compatible with the system 
utilized. 

In another embodiment of the present invention, kits are provided which contain the 

1 5 necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising one of 
the nucleic acid molecules that can bind to a fragment of the Human genome disclosed 
herein; and (b) one or more other containers comprising one or more of the following: wash 

20 reagents, reagents capable of detecting presence of a bound nucleic acid. 

In detail, a compartmentalized kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers, strips 
of plastic, glass or paper, or arraying material such as silica. Such containers allows one to 
efficiently transfer reagents from one compartment to another compartment such that the 

25 samples and reagents are not cross-contaminated, and the agents or solutions of each 

container can be added in a quantitative fashion from one compartment to another. Such 
containers will include a container which will accept the test sample, a container which 
contains the nucleic acid probe, containers which contain wash reagents (such as phosphate 
buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect 

30 the bound probe. One skilled in the art will readily recognize that the previously unidentified 
transporter gene of the present invention can be routinely identified using the sequence 
information disclosed herein can be readily incorporated into one of the established kit 
formats which are well known in the art, particularly expression arrays. 
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Vectors/host cells 

The invention also provides vectors containing the nucleic acid molecules described 
herein. The term "vector" refers to a vehicle, preferably a nucleic acid molecule, which can 
5 transport the nucleic acid molecules. When the vector is a nucleic acid molecule, the nucleic 
acid molecules are covalently linked to the vector nucleic acid. With this aspect of the 
invention, the vector includes a plasmid, single or double stranded phage, a single or double 
stranded RNA or DNA viral vector, or artificial chromosome, such as a BAC, PAC, YAC, OR 
MAC. 

10 A vector can be maintained in the host cell as an extrachromosomal element where it 

replicates and produces additional copies of the nucleic acid molecules. Alternatively, the 
vector may integrate into the host cell genome and produce additional copies of the nucleic acid 
molecules when the host cell replicates. 

The invention provides vectors for the maintenance (cloning vectors) or vectors for 

1 5 expression (expression vectors) of the nucleic acid molecules. The vectors can function in 
procaryotic or eukaryotic cells or in both (shuttle vectors). 

Expression vectors contain cis-acting regulatory regions that are operably linked in the 
vector to the nucleic acid molecules such that transcription of the nucleic acid molecules is 
allowed in a host cell. The nucleic acid molecules can be introduced into the host cell with a 

20 separate nucleic acid molecule capable of affecting transcription. Thus, the second nucleic acid 
molecule may provide a trans-acting factor interacting with the cis-regulatory control region to 
allow transcription of the nucleic acid molecules from the vector. Alternatively, a trans-acting 
factor may be supplied by the host cell. Finally, a trans-acting factor can be produced from the 
vector itself. It is understood, however, that in some embodiments, transcription and/or 

25 translation of the nucleic acid molecules can occur in a cell-free system. 

The regulatory sequence to which the nucleic acid molecules described herein can be 
operably linked include promoters for directing mRNA transcription. These include, but are not 
limited to, the left promoter from bacteriophage X, the lac, TRP, and TAC promoters from E. 
coli, the early and late promoters from SV40, the CMV immediate early promoter, the 

30 adenovirus early and late promoters, and retrovirus long-terminal repeats. 

In addition to control regions that promote transcription, expression vectors may also 
include regions that modulate transcription, such as repressor binding sites and enhancers. 
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Examples include the SV40 enhancer, the cytomegalovirus immediate early enhancer, polyoma 
enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

In addition to containing sites for transcription initiation and control, expression vectors 
can also contain sequences necessary for transcription termination and, in the transcribed region 
5 a ribosome binding site for translation. Other regulatory control elements for expression 
include initiation and termination codons as well as polyadenylation signals. The person of 
ordinary skill in the art would be aware of the numerous regulatory sequences that are useful in 
expression vectors. Such regulatory sequences are described, for example, in Sambrook et at, 
Molecular Cloning: A Laboratory Manual. 2nd. ed., Cold Spring Harbor Laboratory Press, 

10 Cold Spring Harbor, NY, (1989). 

A variety of expression vectors can be used to express a nucleic acid molecule. Such 
vectors include chromosomal, episomal, and virus-derived vectors, for example vectors derived 
from bacterial plasmids, from bacteriophage, from yeast episomes, from yeast chromosomal 
elements, including yeast artificial chromosomes, from viruses such as baculoviruses, 

1 5 papovaviruses such as S V40, Vaccinia viruses, adenoviruses, poxviruses, pseudorabies viruses, 
and retroviruses. Vectors may also be derived from combinations of these sources such as those 
derived from plasmid and bacteriophage genetic elements, e.g. cosmids and phagemids. 
Appropriate cloning and expression vectors for prokaryotic and eukaryotic hosts are described 
in Sambrook et al. 9 Molecular Cloning: A Laboratory Manual. 2nd. ed., Cold Spring Harbor 

20 Laboratory Press, Cold Spring Harbor, NY, (1989). 

The regulatory sequence may provide constitutive expression in one or more host cells 
(i.e. tissue specific) or may provide for inducible expression in one or more cell types such as 
by temperature, nutrient additive, or exogenous factor such as a hormone or other ligand. A 
variety of vectors providing for constitutive and inducible expression in prokaryotic and 

25 eukaryotic hosts are well known to those of ordinary skill in the art. 

The nucleic acid molecules can be inserted into the vector nucleic acid by well-known 
methodology. Generally, the DNA sequence that will ultimately be expressed is joined to an 
expression vector by cleaving the DNA sequence and the expression vector with one or more 
restriction enzymes and then ligating the fragments together. Procedures for restriction enzyme 

30 digestion and ligation are well known to those of ordinary skill in the art. 

The vector containing the appropriate nucleic acid molecule can be introduced into an 
appropriate host cell for propagation or expression using well-known techniques. Bacterial 
cells include, but are not limited to, E. coli 9 Streptomyces, and Salmonella typhimurium. 
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Eukaryotic cells include, but are not limited to, yeast, insect cells such as Drosophila, animal 
cells such as COS and CHO cells, and plant cells. 

As described herein, it may be desirable to express the peptide as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of the peptides. 
5 Fusion vectors can increase the expression of a recombinant protein, increase the solubility of 
the recombinant protein, and aid in the purification of the protein by acting for example as a 
ligand for affinity purification. A proteolytic cleavage site may be introduced at the junction of 
the fusion moiety so that the desired peptide can ultimately be separated from the fusion moiety. 
Proteolytic enzymes include, but are not limited to, factor Xa, thrombin, and enterotransporter. 

1 0 Typical fusion expression vectors include pGEX (Smith et al 9 Gene 67:3 1 -40 (1 988)), pMAL 
(New England Biolabs, Beverly, MA) and pRTT5 (Pharmacia, Piscataway, NJ) which fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the 
target recombinant protein. Examples of suitable inducible non-fiision E. coli expression 
vectors include pTrc (Amann et al 9 Gene tfP:301-315 (1988)) and pET 1 Id (Studier et al 9 Gene 

1 5 Expression Technology: Methods in Enzymology 7*5:60-89 (1990)). 

Recombinant protein expression can be maximized in host bacteria by providing a 
genetic background wherein the host cell has an impaired capacity to proteolytically cleave the 
recombinant protein. (Gottesman, S., Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, California (1990) 119-128). Alternatively, the sequence of 

20 the nucleic acid molecule of interest can be altered to provide preferential codon usage for a 
specific host cell, for example E coli. (Wada et aL 9 Nucleic Acids Res. 20:2111-2118(1992)). 

The nucleic acid molecules can also be expressed by expression vectors that are 
operative in yeast. Examples of vectors for expression in yeast e.g., S. cerevisiae include 
pYepSecl (Baldari, et al 9 EMBOJ. 6:229-234 (1987)), pMFa (Kurjan et al 9 Cell 50:933- 

25 943(1982)), pJRY88 (Schultz et al 9 Gene 54:1 13-123 (1987)), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

The nucleic acid molecules can also be expressed in insect cells using, for example, 
baculo virus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 9 Mol Cell Biol 

30 5:2156-2165 (1983)) and the pVL series (Lucklow et al 9 Virology 1 70:31-39 (1989)). 

In certain embodiments of the invention, the nucleic acid molecules described herein are 
expressed in mammalian cells using mammalian expression vectors. Examples of mammalian 
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expression vectors include pCDM8 (Seed, B. Nature 32P:840(1987)) and pMT2PC (Kaufinan 
etaL,EMBOJ. (5:187-195 (1987)). 

The expression vectors listed herein are provided by way of example only of the well- 
known vectors available to those of ordinary skill in the art that would be useful to express the 
5 nucleic acid molecules. The person of ordinary skill in the art would be aware of other vectors 
suitable for maintenance propagation or expression of the nucleic acid molecules described 
herein. These are found for example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. 

1 0 The invention also encompasses vectors in which the nucleic acid sequences described 

herein are cloned into the vector in reverse orientation, but operably linked to a regulatory 
sequence that permits transcription of antisense RNA. Thus, an antisense transcript can be 
produced to all, or to a portion, of the nucleic acid molecule sequences described herein, 
including both coding and non-coding regions. Expression of this antisense RNA is subject to 

1 5 each of the parameters described above in relation to expression of the sense RNA (regulatory 
sequences, constitutive or inducible expression, tissue-specific expression). 

The invention also relates to recombinant host cells containing the vectors described 
herein. Host cells therefore include prokaryotic cells, lower eukaryotic cells such as yeast, other 
eukaryotic cells such as insect cells, and higher eukaryotic cells such as mammalian cells. 

20 The recombinant host cells are prepared by introducing the vector constructs described 

herein into the cells by techniques readily available to the person of ordinary skill in the art. 
These include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated 
transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, 
lipofection, and other techniques such as those found in Sambrook, et ah {Molecular Cloning: 

25 A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989). 

Host cells can contain more than one vector. Thus, different nucleotide sequences can 
be introduced on different vectors of the same cell. Similarly, the nucleic acid molecules can be 
introduced either alone or with other nucleic acid molecules that are not related to the nucleic 

30 acid molecules such as those providing trans-acting factors for expression vectors. When more 
than one vector is introduced into a cell, the vectors can be introduced independently, co- 
introduced or joined to the nucleic acid molecule vector. 
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In the case of bacteriophage and viral vectors, these can be introduced into cells as 
packaged or encapsulated virus by standard procedures for infection and transduction. Viral 
vectors can be replication-competent or replication-defective. In the case in which viral 
replication is defective, replication will occur in host cells providing functions that complement 
5 the defects. 

Vectors generally include selectable markers that enable the selection of the 
subpopulation of cells that contain the recombinant vector constructs. The marker can be 
contained in the same vector that contains the nucleic acid molecules described herein or may 
be on a separate vector. Markers include tetracycline or ampicillin-resistance genes for 

1 0 prokaryotic host cells and dihydrofolate reductase or neomycin resistance for eukaryotic host 
cells. However, any marker that provides selection for a phenotypic trait will be effective. 

While the mature proteins can be produced in bacteria, yeast, mammalian cells, and 
other cells under the control of the appropriate regulatory sequences, cell- free transcription and 
translation systems can also be used to produce these proteins using RNA derived from the 

1 5 DNA constructs described herein. 

Where secretion of the peptide is desired, which is difficult to achieve with multi- 
transmembrane domain containing proteins such as transporters, appropriate secretion signals 
are incorporated into the vector. The signal sequence can be endogenous to the peptides or 
heterologous to these peptides. 

20 Where the peptide is not secreted into the medium, which is typically the case with 

transporters, the protein can be isolated from the host cell by standard disruption procedures, 
including freeze thaw, sonication, mechanical disruption, use of lysing agents and the like. The 
peptide can then be recovered and purified by well-known purification methods including 
ammonium sulfate precipitation, acid extraction, anion or cationic exchange chromatography, 

25 phosphocellulose chromatography, hydrophobic-interaction chromatography, affinity 

chromatography, hydroxylapatite chromatography, lectin chromatography, or high performance 
liquid chromatography. 

It is also understood that depending upon the host cell in recombinant production of the 
peptides described herein, the peptides can have various glycosylation patterns, depending upon 

30 the cell, or maybe non-glycosylated as when produced in bacteria. In addition, the peptides 

may include an initial modified methionine in some cases as a result of a host-mediated process. 
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Uses of vectors and host cells 

The recombinant host cells expressing the peptides described herein have a variety of 
uses. First, the cells are useful for producing a transporter protein or peptide that can be further 
purified to produce desired amounts of transporter protein or fragments. Thus, host cells 
5 containing expression vectors are useful for peptide production. 

Host cells are also useful for conducting cell-based assays involving the transporter 
protein or transporter protein fragments, such as those described above as well as other formats 
known in the art Thus, a recombinant host cell expressing a native transporter protein is useful 
for assaying compounds that stimulate or inhibit transporter protein function. 

1 0 Host cells are also useful for identifying transporter protein mutants in which these 

functions are affected. If the mutants naturally occur and give rise to a pathology, host cells 
containing the mutations are useful to assay compounds that have a desired effect on the mutant 
transporter protein (for example, stimulating or inhibiting function) which may not be indicated 
by their effect on the native transporter protein. 

1 5 Genetically engineered host cells can be further used to produce non-human transgenic 

animals. A transgenic animal is preferably a mammal, for example a rodent, such as a rat or 
mouse, in which one or more of the cells of the animal include a transgene. A transgene is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature animal in one or more cell types or 

20 tissues of the transgenic animal. These animals are useful for studying the function of a 

transporter protein and identifying and evaluating modulators of transporter protein activity. 
Other examples of transgenic animals include non-human primates, sheep, dogs, cows, goats, 
chickens, and amphibians. 

A transgenic animal can be produced by introducing nucleic acid into the male 

25 pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster animal. Any of the transporter protein 
nucleotide sequences can be introduced as a transgene into the genome of a non-human animal, 
such as a mouse. 

Any of the regulatory or other sequences useful in expression vectors can form part of 
30 the transgenic sequence. This includes intronic sequences and polyadenylation signals, if not 
already included. A tissue-specific regulatory sequence(s) can be operably linked to the 
transgene to direct expression of the transporter protein to particular cells. 
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Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et a/., U.S. Patent No. 
4,873,191 by Wagner et al and in Hogan, B., Manipulating the Mouse Embryo* (Cold Spring 
5 Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified based 
upon the presence of the transgene in its genome and/or expression of transgenic mRNA in 
tissues or cells of the animals. A transgenic founder animal can then be used to breed additional 
animals carrying the transgene. Moreover, transgenic animals carrying a transgene can further 

10 be bred to other transgenic animals carrying other transgenes. A transgenic animal also 

includes animals in which the entire animal or tissues in the animal have been produced using 
the homologously recombinant host cells described herein. 

In another embodiment, transgenic non-human animals can be produced which contain 
selected systems that allow for regulated expression of the transgene. One example of such a 

15 system is the cre/loxP recombinase system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al PNAS £9:6232-6236 (1992). Another 
example of a recombinase system is the FLP recombinase system of S. cerevisiae (O'Gorman et 
al Science 257:1351-1355 (1991). If a cre/loxP recombinase system is used to regulate 
expression of the transgene, animals containing transgenes encoding both the Cre recombinase 

20 and a selected protein is required. Such animals can be provided through the construction of 

"double" transgenic animals, e.g., by mating two transgenic animals, one containing a transgene 
encoding a selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, I. et al Nature 3*5:810-813 (1997) and PCT 

25 International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a somatic 
cell, from the transgenic animal can be isolated and induced to exit the growth cycle and enter 
G 0 phase. The quiescent cell can then be fused, e.g., through the use of electrical pulses, to an 
enucleated oocyte from an animal of the same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it develops to morula or blastocyst and then 

30 transferred to pseudopregnant female foster animal. The offspring bom of this female foster 
animal will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

Transgenic animals containing recombinant cells that express the peptides described 
herein are useftd to conduct the assays described herein in an in vivo context. Accordingly, the 
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various physiological factors that are present in vivo and that could effect ligand binding, 
transporter protein activation, and signal transduction, may not be evident from in vitro cell-free 
or cell-based assays. Accordingly, it is useful to provide non-human transgenic animals to 
assay in vivo transporter protein function, including ligand interaction, the effect of specific 
mutant transporter proteins on transporter protein function and ligand interaction, and the effect 
of chimeric transporter proteins. It is also possible to assess the effect of null mutations, that is 
mutations that substantially or completely eliminate one or more transporter protein functions. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the invention 
as claimed should not be unduly limited to such specific embodiments. Indeed, various 
modifications of the above-described modes for carrying out the invention which are obvious 
to those skilled in the field of molecular biology or related fields are intended to be within 
the scope of the following claims. 
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Claims 



That which is claimed is: 

1 . An isolated peptide consisting of an amino acid sequence selected from the 
group consisting of: 

(a) an amino acid sequence shown in SEQ ID NO:2; 

(b) an amino acid sequence of an allelic variant of an amino acid sequence 
shown in SEQ ID NO:2, wherein said allelic variant is encoded by a nucleic acid molecule that 
hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule shown in 
SEQ ID NOS: lor 3; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown 
in SEQ ID NO:2, wherein said ortholog is encoded by a nucleic acid molecule that hybridizes 
under stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID 
NOS:l or 3; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NO:2, wherein 
said fragment comprises at least 10 contiguous amino acids. 

2. An isolated peptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ID NO:2; 

(b) an amino acid sequence of an allelic variant of an amino acid sequence 
shown in SEQ ID NO:2, wherein said allelic variant is encoded by a nucleic acid molecule that 
hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule shown in 
SEQ ID NOS: lor 3; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown 
in SEQ ID NO:2, wherein said ortholog is encoded by a nucleic acid molecule that hybridizes 
under stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID 
NOS:lor3; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NO:2, wherein 
said fragment comprises at least 10 contiguous amino acids. 

3. An isolated antibody that selectively binds to a peptide of claim 2. 
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4. An isolated nucleic acid molecule consisting of a nucleotide sequence selected 
from the group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in 

SEQIDNO:2; 

(b) a nucleotide sequence that encodes of an allelic variant of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS: 1 or 3; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS: 1 or 3; 

(d) a nucleotide sequence that encodes a fragment of an amino acid 
sequence shown in SEQ ID NO:2, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence 

of(a)-(d). 

5, An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in 

SEQIDNO:2; 

(b) a nucleotide sequence that encodes of an allelic variant of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS: 1 or 3; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS: 1 or 3; 

(d) a nucleotide sequence that encodes a fragment of an amino acid 
sequence shown in SEQ ID NO:2, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence 

of(a)-(d). 



6. A gene chip comprising a nucleic acid molecule of claim 5. 
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7. A transgenic non-human animal comprising a nucleic acid molecule of claim 5. 

8. A nucleic acid vector comprising a nucleic acid molecule of claim 5. 

9. A host cell containing the vector of claim 8. 

10. A method for producing any of the peptides of claim 1 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

11. A method for producing any of the peptides of claim 2 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

12. A method for detecting the presence of any of the peptides of claim 2 in a 
sample, said method comprising contacting said sample with a detection agent that specifically 
allows detection of the presence of the peptide in the sample and then detecting the presence of 
the peptide. 

13. A method for detecting the presence of a nucleic acid molecule of claim 5 in a 
sample, said method comprising contacting the sample with an oligonucleotide that hybridizes 
to said nucleic acid molecule under stringent conditions and determining whether the 
oligonucleotide binds to said nucleic acid molecule in the sample. 

14. A method for identifying a modulator of a peptide of claim 2, said method 
comprising contacting said peptide with an agent and determining if said agent has modulated 
the function or activity of said peptide. 

15. The method of claim 14, wherein said agent is administered to a host cell 
comprising an expression vector that expresses said peptide. 
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16. A method for identifying an agent that binds to any of the peptides of claim 2, 
said method comprising contacting the peptide with an agent and assaying the contacted 
mixture to determine whether a complex is formed with the agent bound to the peptide. 

17. A pharmaceutical composition comprising an agent identified by the method of 
claim 16 and a pharmaceutically acceptable carrier therefor. 

18. A method for treating a disease or condition mediated by a human transporter 
protein, said method comprising administering to a patient a pharmaceutically effective amount 
of an agent identified by the method of claim 16. 

19. A method for identifying a modulator of the expression of a peptide of claim 2, 
said method comprising contacting a cell expressing said peptide with an agent, and 
determining if said agent has modulated the expression of said peptide. 

20. An isolated human transporter peptide having an amino acid sequence that 
shares at least 70% homology with an amino acid sequence shown in SEQ ID NO:2. 

21. A peptide according to claim 20 that shares at least 90 percent homology with 
an amino acid sequence shown in SEQ ID NO:2. 

22. An isolated nucleic acid molecule encoding a human transporter peptide, said 
nucleic acid molecule sharing at least 80 percent homology with a nucleic acid molecule shown 
inSEQIDNOS:l or 3. 

23. A nucleic acid molecule according to claim 22 that shares at least 90 percent 
homology with a nucleic acid molecule shown in SEQ ID NOS: 1 or 3. 
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1 ATOGTOCTCT CCCAGGAGGA GOOGGACTCC GCGCGGGGCA GGAGC3G4GGC 
51 GCAGCGGCTC GGOQCOGCGC CCACGGGGGC GGCTX30GCOG CCCGGCCCGG 
101 OVGGCTOGG* CAGOCCGGAG GCQGCTGTCG AG^AGGTGGA GGTGGAGCTG 
151 GOQQGQCGQG OGA00G0GGA GCGCCATCAG CCCCCGGAAC GGGCGGAGGG 
201 GQGCTGQQGC TGGCTGGTG* TGCPGGGGGC CATCTGOTGC AAOQQOTCGG 
251 TCTTOGGCAT CCAGAACGCT TCC3QQQOTQC TCTTOGTCTC CATCCPQGAA 
301 ACCTTCQGCT CCAAAGACG* TGACAAGATG GTCTTTAAGA CAQCAGCATG 
351 GOTAGGTTCT CTCTOCATOG GGMGATTTT CTTTTCCTGC CCAATAGTOV 
401 GGGTCTTCAC AGAGCTATTT GCjI IGKjGGA AAACAGCTGT CGTGGOTGCT 
451 GCTUTTCOVT TTOTTGQGCT CATCTTXAGT IUIIIGIAA 6TTCCATOGA 
501 GCCTCTOTAC CTTACXTATC GAATCATATT TCCCTGGGGC TCLKXI I PG 
551 CATAOCAGOC TTTATTOGTC Al 1 1 PGQGAC ACTATTTCAA GAAGCGCCTT 
601 GGAGPGGTG* ATGGCATTCT CACTCCPQGC AGCAGTGTCT TC ACAAT QCT 
651 GCTOOCTTTC CTCTTAAGQG TTC TGATTGA CAGGGTOQQC CIU I MACA 
701 CATTGAGGCT GCTCTCCATC TTCATCTTTC TTCTCTTTCT GGCTGGCTTT 
751 ACTTAOGGAC CIU IU TAC CAOTACCAAA GATAAAGAGA GTGGAGGTAG 
801 GQGATOCTOC CTCTTTTOCA GGAAAAAGTT QVGTCCTOCA AAAAAAATTT 
851 TCAATTTTGC CATCTTCAAG GTGACAGCTT ATGCAGTGTG GGCAGTTCGA 
901 ATAOCACTTG CA LI I I I ICG ATACTTTCTC CCTTAIGI IC ACTTGATC AA 
951 AOVTOTAAAT GAAAGATTTC AAGATCAAAA AAATAAAGAG GTTCTTCTCA 
1001 TCTGCATTCG CXTTCACTTCA GGAGTTGGAC GACIGUU I TX5GCOGGATT 
1051 GONGMTATG TGGUGGTCT GAAGAAGGTT TATCTACAGG TACPCTOCTT 
1101 TTTCTTCATT GGTCPGATGT CCATCATGAT TCCICTGTGT AGCATCTTTG 
1151 GGGOOCTCAT TCCTGTGTCC CTCATCATCG GVL\U ICJGA TCGATGCTTC 
1201 ATTTCCATTA TGGCPCGDVT AGCCTTTGAG TTAGTTOGTG CCCAGGATCT 
1251 CTOOCAAGCA ATTGGATTTC TGCTCGGATT CATCTCTATA GGCATGACTG 
1301 TTGGCGCAGC CATTCCAGGG TTACmCGTG ACAAACTGGG CTCCTATGAT 
1351 GTGGCATTCT AGCTOGCTCG ACVCCCTCCC CTTATTOGAG GTGCTGTGCT 
1401 TTCTTTTATC GQGTGGMTX ATAGTAAGAA GCAAAGAGAG ATCAGTAAAA 
1451 CCACTGGAAA ACWVAAGA7G GAGAAAATGT TGGAAAAGCA GAACTCTCTG 
1501 CTCTCAAGTT CATCTCGAAT GTTCAAGAAA GAATCTGACT CTATTATTTA 
1551 A (SBQ ID NO: 1) 
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1 MVLSQEEPD5 ARGT5EAQPL GPAFPGAAPP PGPGPSDSPE AAVEKVEVEL 
51 AGPATAEPHE PPEPPBQGWG M_\M^AMWC NGSVFGIQNA CGVLFVSMLE 
101 TFGSKDOOKM VFKTAAWVGS LSMG^TFFCC PIVSVFTDLF GCRKTAWGA 
151 AVGFVQMSS SFVSSIEPLY LTYGIIFAGG CSFAYQP5LV ILGHYFKKRL 
201 GLS/NGF/TAG SSVFTTLLPL LLRVLDDSVG LFYTLRVLd FMFVLFLAGF 
251 TYRPLAT5TK DKESGGSGSS LF5RKKFSPP KKIFNFAIFK VTAYAVWAVG 
301 IPLALFGYFV PYVHLMKHVN ERFQPEKNKE WLMCIGVTS GVGRLLFGRI 
351 ADYVFGVKKV YLQVLSFFFI QMSNMIPLC SIFGALIAVC LIMGLFDGCF 
401 ISimPIAFE LVGAQPVSQ* IGFLLGFMSI PMTVGPP1AG LLRDKLG5YD 
451 VAFYLAGVPP LIQGAVLCFI PWIH5KKQRE ISKTTGKEKM EKMLENQNSL 
501 LSSSSGMFKK ESDSH (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 
PD0000001 PS00001 ASMJ3-YCDSYLA7TON 
N-glycosylation site 



PDOC00002 PS00002 GLYOOSAMINOGLYCAN 

Glycosanrrinoglycan attachment site 
340-343 SGVG 

PDOG00004 PS00004 CAMP.J3HOSPHCLS1TE 

camp- and e(WP-dependent protein kinase phosphorylation site 
Nurber of matches: 2 

1 275-278 KKFS 

2 509-512 KKES 

PDOG00005 PS00005 FKCLPHOSPHQ^SrTE 
protein kinase C phosphorylation site 
Nunber of matches: 7 

1 10-12 SAR 

2 234-236 TLR 

3 251-253 TYR 

4 258-260 STK 

5 273-275 SRK 

6 475-477 SKK 

7 485-487 TGK 

PDOC00006 PS00006 CK2LPH0SPHO_SrTE 
Casein kinase II phosphorylation site 
Number of matches: 6 

1 4-7 SQEE 

2 97-100 SMLE 

3 104-107 SKDD 

4 164-167 SSIE 

5 258-261 STKD 

6 485-488 TCKE 

PDOC00008 PS00008 MYRISTYL 
N-rryristgylation site 
Nurrber of matches: 15 



1 


13-18 


GT5EAQ 


2 


82-87 


GSVFGI 


3 


141-146 


GCRKTA 


4 


149-154 


GAAVGF 


5 


156-161 


GLMSSS 


6 


174-179 


GUFAC 


7 


180-185 


GCSFAY 


8 


201-206 


GLVNGI 


9 


230-235 


GLFYTL 


10 


265-270 


GGSGSS 


11 


300-305 


GEPLAL 


12 


337-342 


GVTSGV 


13 


384-389 


GALIAV 


14 


394-399 


GLFDGC 


15 


464-469 


GWLCF 



81-84 NGSV 
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PDOC00013 PS00013 PRCXARJJtPOPROTEIN 
Pnokaryotic mentorane lipoprotein lipid attachment site 
169-179 LYLTYGEEFAC 

PDOC00029 PS00029 LEUQNE^ZIPPER 
Leucine zipper pattern 

365-386 LSFFFIGLMSNMIPLCSIFG^L 

PDOC00240 PS00267 TACHYKININ 
Tachykinin family signature 
Number of matches: 2 

1 154-158 FVGLM 

2 369-373 FIGLM 



Menbrane spanning structure and domains: 

Helix Begin End Score certainity 

1 67 87 1.683 Certain 

2 117 137 2.220 certain 

3 146 166 1.923 certain 

4 169 189 1.494 Certain 

5 202 222 1.761 Certain 

6 232 252 1.893 certain 

7 291 311 1.874 Certain 

8 330 350 0.801 Putative 

9 364 384 2.458 Certain 

10 387 407 1.707 Certain 

11 419 439 1.966 certain 

12 453 473 1.826 Certain 



blast Alignment to Top Kit: 

>CRA| 62000057354769 /alticfcgi 114090278 /defedbj | BAB55595.lt CAB047324) 
TATl [Rattus norvegicus] /org=Rattus norveqicus 
/taxon=10U£ /di\M*0D /dataset=nraa /length=514 
Length = 514 

score = 874 bits (2233), Expect = 0.0 

Identities = 435/517 (8430, Positives - 463/517 (8990, Gaps = 1/517 (000 
Frame = +1 

Query: 232 WLSQEEPDSA-RGTSEAQPLGPAPK^^ 408 

MV S EEP +A R T+EAQP GPAPf AP P PGPSD + +VEKVEVEL + 
Sbjct: 1 Mtt\JEEPAAAERElUEAQPPGP 58 

Query: 409 EPrePPEGGAiG^NMJW^^ 588 

EPPEPreGGAiG^VMJ\AM^^ GVLFVSMLETRrfKDDD M FX AAWVG 

Sbjct: 59 EPPEPPEGGWG^VN^^ U7 

Query: 589 SL^MC*1IFF<3CPIV5VF^ 768 

SLS^OIIFTCCPIVSVFn^^ TVGf+FAC 
Sbjct: 118 SLSMO^FFCCPIVSN^^ 177 

Query: 769 gcsfayqpslvilcjmtckrlglvngivt^^ 948 

QCSFAYQPSLVILGHYFKKRLGLVNCIEVTAGSSVFTTLIJ'^L L +VGL YTLR+LC 
Sbjct: 178 QCSFAYQPSLVILCJf^FKKRLGLVNGIVT^ 237 

Query: 949 IR^FVLJT-AGFTYRPLATSTKDKESGGS FKVTAYAVWAV 1128 

IFMFVLFLAGFTVRPL S+K-+KES S SS F5R+K SPPKKIFNFAf FK TAYAVWA 
Sbjct: 238 IFMFVLFLAGFTmPLVPSSK^^ 297 

Query: 1129 giplalpgyfvpvvhlmchwerj^ 1308 

GIPLALFGYFVPWHLM HV ERF+O NKEV+ MaGVT5GVGRLLFGRIADY+P(3VKK 
Sbjct: 298 GIPUVLR?rt=VPVVHLM^^^ 357 

Query: 1309 vylcmsfffiguvbmvepl 1488 

VYLCM-SFFFIO. SM^PLCSfFGAU^VfCXIMGLF^^ CP SQ 
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Sbjct: 358 mQ^FFFI<iTSN|VttPL<^^ 417 

Query: 1489 aigfllgr^ipWTVG^ 1668 

AIGFU.<3^IPMTVGP**WVGLL DKLGSYD+AFYLAGfPP IGGAVLC IPWIHSKKQR 

Sbjct: 418 AIGFLLGF^IPMTVGPfVAGLLHX 477 

Query: 1669 EISKTrGKEW^KNO-EN^ 1779 

ELSK TC EKMEKML NQfSLLSSSSGf FKKESDSH 
Sbjct: 478 EISKNraGEKJ^fcK^^ 514 (SEQ 3D NO : 4) 

miner search results (Pfiam) : 

Model Description 3 Score E- value N 

PF00664 ABC transporter transmembrane region. 4.2 7.1 1 

PFO1027 uncharacterized protein family 3.4 7.6 1 

PF00083 sugar (and other) transporter 2.0 7.4 1 

Parsed for domains: 

Model Domain sea-f sea-t httm-f hnm-t score E- value 

PF00083 1/1 11 50 184 223 2.0 7.4 

PF01D27 1/1 121 177 1 59 [. 3.4 7.6 

PF00664 1/1 165 232.. 1 76 [. 4.2 7.1 
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1 AATCGGTATT TTCTAGACTG TCCTTGATTC GGAI I TGICT AAIGI 1 1 I IC 
51 7GATCGTTAG GCAGGGATTA TOQSTTTTO6 CAGGAAGAGC ACAGAGGTAA 
101 AGTAGCATTT TCATCAGATC ATATGGGGGA TAGATTATCA TCTAGTTGAG 
151 GTACTGrGTG CCATTTTTTC CAGCCTAAAG TTATTTCTTC COGOCACTOC 
201 OC CI I I ClLA T COATACTCT TRGGAAGAAA GTTACTAOGC ATAGGCACAC 
251 TTAAAGAGTA AACCATTCTA CTTCAGCTOC ATGAGGGAGG GAGTATGI IC 
301 ATAAAGTATT TACATTTCCT GCAGGAGAGA TTTCTCTATT CTCTCCTCAT 
351 TATTTATTTA ATCATTTACT TACATCAGTA CPGACTCGIG GATAATIC I I 
401 ACATA1GTGT TTGTTTGTGT GCATGCAAAT ATATAATCGA IGIGLI I ICI 
451 TTGGGGAATA ATATGTTGTG GAOVACTTTC AAAGTCAATA AATACAGATG 
501 ACCTTCAGAA CTTTTAGAGG TTTTAAAGTA AGTATCTAAT CAGICI ICIA 
551 GCAATGTACA TTATACTTCC AAATT TTOCT TATTTOCAAC AATA CTGGGG 
601 TATCATCmC ATACATACAT I I I IGIGGAC TTATCTGCCT ATTCLI I IGI 
651 TTACTATTTT ACGCTCATTT CTAAGGCAGA TTACACTTCA GCT ATCTTO C 
701 OCATTCCACA AOCAAGGAGG GCTTGCTTTC C7TAGTTTAT GCTATTTTCT 
751 CTAGCTGGAA TCCTCTTTTC CTGTCTTCAC GCACTGAAGT GGTATCTATC 
801 AATOVGQQCT TCGQOCAAAG GCTGTTTOCT GTAGA QQCTC TTCTACAGTG 
851 TTTOGAGAGA ATTTAAQQGA CTATCPCATC ICICICM J I GTACATCTAT 
901 TATAACACAT CATOGAGCC TGCTAGTCTC TCCCAGGACT CI I M I ICCT 
951 AOCAGTTTAT CAACTGATAA GAGQCAGAAA CGAGATCAAT OGCACTGATC 
1001 TCTGTACPCT ATCACAGTGG TOQQCAGATC AAGTAACTAG CATATTTTGA 
1051 CTTFGATTGA AGTOAAGAAT ACGAATAACA GAAATTAAGA AGCATCCTCA 
1101 ATATTGCATA GCAGGTTACT C I ICI I I ICI TT TACATA GG ATGGCACTGC 
1151 ATCCTTCAGG GAGACAGAGG AGTTGAATAC AGG1 1 1 IAGT TTTTGTTTAA 
1201 AGTGAAAAGG ACTOGATGT AGTPGAAAAG TAATCCTTTC TAGCTGTCTG 
1251 TTAAAAATCT ITCTTCGTTC AAGACTTQGG AATTOCAGTC CAGTGAGGAC 
1301 TCAAAATAAG C AICI I IG GT GTGCCAAATA TTCATAAGGA AATTGTATAC 
1351 GAATGCAAGA GAATGGAACT GAAGTAATAA AATAAGGGCT CTGATC3CTTC 
1401 AGA7GACTTA TTTAAGWGC CAGGTGGCAT AAOGAATCTT ACATATTATA 
1451 ATTAGTACTG AGAGGFGAAT GCCAAAAGAT AAAACAAACA CAATCGAGAC 
1501 AATGTTAGTG TCACTGTCAC GCTGTGTOGG TCAGTTCAGG CTAA0GATCC 
1551 AGTGTX3GCTC TOCTGAAGGC GCAGGGCGCC CGCAOCTAGG AGACGCGOCC 
1601 C1IC1GUCA TGCTTTGAGG GGGGGTGACC CACACATCTG TCGOCCTCTC 
1651 TCAGCAGGAG GAGGOGOGGT CGCAGAOG0G GGGGCAGACA GCGTCPGCQG 
1701 OGGGCACCTG GGGO0GOGOG GOGCGGGGOG GCCX3G0CT0C GCTCTOGGAG 
1751 GGCCAATCAT CTGGAGGCTG TOGGGGCAOG TXZCXjGCPCGC GGCCAOGCCC 
1801 CCAGGGGGGG GGGOGGGGGG TCCTTTTAAG AAGOGGOGGC TCGCAGTGGG 
1351 CTCAGTCGGG GGTOQGGGGC TGTGAOCTAG AGQCTTCAGT GTOGATCCCC 
1901 GAG GTC! ICG GGCGCGOCAG CTOTCCTOGC GGGCGGCTGC GCGCTGGCCG 
1951 OCTGOGOGCT GCCAGCCCGC CCGOCCGGCA GGGGCTQOGC CGOOCTOGCC 
2001 TCGGGCTOGT TAGOCGGCCA GGAGCCCOGC AGCTCCTOOG GGAGCCCGCT 
2051 GGTAACTOGC GTCOCTOGOG C I I CICO GGC GOCTGAGGGG CGGGCCPGGG 
2101 GOCATOGTGC TCTOOCAGGA GGAGCCGGAC TCOGCGOGGG GCACGAGGGA 
2151 GGCGCAGCCG CTXjGGCOCGG CGCCCACGGG GGCOGCTCCG GOGCCGGGGC 
2201 CGGGACGCTC GGAGAGOOGC GAGGOGGCTG TCGAGAAGGT GGAGGTGGAG 
2251 CTGGCGGGGC GGGCGAGOGC GGAGGOCCAT GAGGCCCCOG AACCCGCGGA 
2301 GGGGGGCTCG GGCTGGCTGG 7GATGCTGGC GGCCATCTOG TGCAAOGGGT 
2351 OGGTGTTOGG GATOCAGAAC GCT7GCGGGG TCCTCI IU>I GTXXA7GCTG 
2401 GAAACOTCG GCTOCAAAGA GGATGACAAG ATGGTCTTTA AGACAGGTGA 
2451 GGCGOGGCGC GOGGQGAGGC CAGCCTGGGC GACCOGCGTG GGGCGOOQGA 
2501 GCGCATCGOG OGTGTGGGCT GIGrCTGCGT GCGAGTGTGC ATGTGGGTGG 
2551 GTOCCTCTCC CAGAGGGTGC GAGCAGGGGG GIU I ICGAG TTGCAGACAG 
2601 AGGCTQGOGC TTOGGGGOC TOGGGGTP90C (Jb ICI II A TA TGGAATCCAG 
2651 CTGCAGAGCT GTGTGTTTGC AAGOAGGTCG CAGAACTTAC TTCGOGAGAT 
2701 OGTOCT0C1 I TCCGCTCAGC AGAGCAGACG CTAACAOTOC ACAGGAGGCC 
2751 I ICCI 1 1 1 A T TGTTTGAAAA GAAACA GAAC CCCAGAAOCT TCAAOOOCAG 
2801 TCATCGGOCT GTCATTTTTC TGGTOC1 1 1 GGTGACTATG CCAGTTATGT 
2851 A G II CHC AC CTGOCTCGCT GGGOOGCAGA GGGGTGTGOG TATGTTGGGG 
2901 GGGOGGGGGG TOGAGTTTOG AGGAATCAAA GAGA 1 1 IG IA OGAAGGTCAC 
2951 TGGAGTTCCA AAGGGGGOGC TGCAAGAGTC ACGGTTCGGT GCGTTGCGGT 
3001 (XCXDGOGOCr 1 1 1 I I I U XC tctgggttaa ATGTAGAAAA CAOGGGAGGC 
3051 AGOOGGATTA GGGACTAGGA TGAGGAAGGT GAAGGGTPGC I ICI ICGCTC 
3101 ITCCTCT TC1 GTTTTTTGAC AM I I 1 1 1 1 1 AACCATATAG TAAATTAGAT 
3151 ACAAAAGGTG CAGATTCAGC Gl 1 1 ICICQC TGTAGAGCAT TATTATGACT 
3201 1 1 1 IG GCTGG TTAGGCAAAA AACAAATCTA AGACCI ICIG CA7GACACTT 
3251 TAAGATAAAT I CI J IC ACTT TATCCTGCAA GGTGAGOGOG GTCAAGCCGA 
3301 TTTGGGTGAG AAAACTGTAG CTCAGTGAAA GTGTOTGGT GGGTAGTAGA 
3351 ATOGCAATAA AACACATATC AACTGACTTC AAGGGCTAAG TGATTTOCAT 
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3401 TACTAAATCA ACCTOCCTCC CCATCATTTC GQGTAACTTT ATATGATTAA 
3451 TAG ICI I 1 1 1 TTTTAACCTT GATTT7CTAT TATTTTTAGA GTGAATATTT 
3501 CTTAGGTCTT TAGTATGGAT ATGAGGAATC GGCAAGACTG TAATAAATTC 
3551 TGAGACAAAG OTAATQCTQG GnATOCTGA GAGTTTTAAA ACCTGACATA 
3601 AATACTATTA AACTAI I IGI ATCATTCTQC AAOTACnT TCTTOCATTC 
3651 CGCATCATGT TTCTGACTTA TOCACATAAT ACCTCACTOT GAACTGATAA 
3701 CTCAAATTCT TTCCATTTTA AOTAGOTQG TTTGCATTGT TTGACTATAT 
3751 TATACTOAT GCATTCTCCC TCTGATCGGC ATTTAGATTC CTTOCAAACr 
3801 CATTCTAAAC AATCCT9CAA TGAATATTCT TCTACAOCT CGTTATGCAT 
3851 GTGAATAGGG TAGCATTTTA AGCTOGAATT TCTCTTC1 1 1 AAATAGCTAT 
3901 TGAAACTGCT GTOGTATOCG QOTCAATOQG CTAGGTAGAA AAAGTGTTAA 
3951 AAATCTAGTA ACATAICJLI I ACCATTTAAG GGAAGTAATC ATTCTAAAGT 
4001 TTAQCAQQQG AGATATQCAT ATATAATAGC AAACAAAAAT AGTTTCTTCT 
4051 CI I 1 1 CI ATA TCAGTATTOG GTGTCAGAGA GAAAAGCCGC AAAAGAAGGC 
4101 AGAATTGACA GAGTTAAGAT TTAAAGACTA GTTCGAACAT TTAOCATATT 
4151 CCTGCCTGGG ATTACAGATT TTTTAATCCA GKAAGATAA GAGCAGTCTT 
4201 I IGI I IATCA TTGTTTTTCC AAATTCAGTT AAGTAGATCC TTTGGTOT CT 
4251 GTOQGTOQOT 1 i 1 1 1 1 1 1 I I 1 1 1 1 1 II I 1 1 1 I I 1 1 J I 1 1 1 MINI I GAG 
4301 AGAGAGAGAG AGCAATTCCC AGAGAGAOCA TAGCTTTGOC AGGGATGAGA 
4351 ATTTTOCAGr GFCAAAGTCT CTACCTACTA CTTCTCOXA AAGTTCTAAT 
4401 TGGCTACACA ATATOXAAT ACTGGGTAGC TGAGAGTGAG GGAAGGAAOC 
4451 TC G1 1 1 MC I TTTGCACTCT GTGGAACTTT GIOT1 1 1 C C A 1 1 I PGATGAA 
4501 TATCTTTTTT Clll I IACTC AGTTOVGTCT TTGACAACTT TTTCAGTCAT 
4551 GTTTCTGTAT GIGTGGCTAT ATATCATATA AAGAGTRGCA CAGGTGTGCT 
4601 AGTTAAATGT GTGAAGATCT I IGIGI I ICI CTGOCTGACT GCTGTATATC 
4651 TATTTATGGT TCTGOCATTG CACAAGGGTG CGCAACTCAG GGGTAAGTGG 
4701 GGACTGAAAA OCAGCCTGGG CTCTGGGTGC CLI I IGICIG ATTOCTAGAG 
4751 AAGGGG0CTA TCCACTTCTA ATGGCGCTCT ATAACACAGC ATCTAGATTG 
4801 AACAATGGCC ATTACTTGGG TCCTAGGTAA TACATATATC ACTGATAGAT 
4851 GTTAAGGGCT GAGGAAGAAA ACAGATTTAA ACTTAGTGCT GAAAAAATGG 
4901 TTAGAAGATA GTCTTTAAGC CAG7TATTGT TGAGATC7CT CI C I C I ICCC 
4951 CTGTOGC3CTA GCGLI I ICIC I I ICC I ICAG TGCACACACA CACACACAAA 
5001 GWGI 1 1 CAT GAAGTOOGPC ATCTAGCAGA GTCACTGTTA TTTCAGAATA 
5051 ICIGU I IGA Abl I IGATTG GTCAGACTTT TTCACI I IGA TATTCGAATG 
5101 CTGAGTOGTC TCTGATCAAG CATATGCAAG CTTCAAATAC ATGGCAAAAA 
5151 ATATCTGGAA TTTGTTTAAG GCTTTTATTT TTCAAAGTTT TGGTCTATTT 
5201 TCTATTAGOG TACTCATGAT GGATAATCCT GGTGTTAGAG TAGAGCTAGT 
5251 TCTCTCnOCT I GU I G CATT AC I ICI I I AT AGCAAGTGAC TAGCCTAAGG 
5301 ATATACAGGG AGGT G GT G GT GGAATGGAAT CTAGGTCPOC AAATGATGGT 
5351 GOGCAI I ICI TGAGTACTTT GCTGTGGCTA AGCACTTTAG A7GCGTTCCT 
5401 ATTTAAACCT TAGCACGATT CTOGATAGA CI I IGI IAAT AICI I ICI II 
5451 TCAGATATGG GAACTCAGGC TTAGAGAGTT TAAGTAAGAA GTGGAGOCAG 
5501 AATTCAACCC CAGGCTTATC TCACTCTAAG AGCTOGGATT TTTATTTTAA 
5551 TTATTTATTT ATTTAAAATA 7GGAA7GCTT CA7GAATTTC GATCTCATCC 
5601 I IGI ICAGGG GTCAOGCTAA ICI ICICIGI GTCATTCGAA TTTTAGFAGA 
5651 I IGI IGI ICA AAGIGCIGCT GAAGCAAGCA CCAGGAGCTG GGTTTTAATC 
5701 ATTCATCATA TTGCATTGAC TAGATAACAT TCTGCAAATA CGAIGI 1 1 I I 
5751 TAIGI IGI IG ATTAATTTAA GIG I IAGTGA ITGG1 I G AGT GCTCTAGCAT 
5801 GCATTCTOQG ATTAGAAAGA AGGGPCOCTG TTTCTTGGTC CTACI I IGIG 
5851 GTGAATAAAC AATTCCAAAT TATTAATGTC TCAAACTATA 1 1 ICIGAAGT 
5901 GTAGAGAGAC TTCCATAGAA GAACAAGATA CTTCCATATG G0G7TCAAGC 
5951 AAAAGTOGG G G1 1 I CC 1 1 I GAAGAAOTT TAGATTGATC CACAGCAGGA 
6001 CAAIGI I ICI AGGCAGAACT GAGGAGGAGC CI I ICI IAGG CTCACTTCPC 
6051 TTCAGGGCTC TCTTAACTCT TCCCACGCAA TOGATAATCT ACGCAAAATT 
6101 TCTCAGGAAA GGGGC7GAAG AAGTTCATTC ACACTAAGGT GTAAGTGACT 
6151 TTACACATCT TACPGTTAAT TOO I I A TA CAAATCTTTA CCAAGTTATC 
6201 TAACACGCTT IGi 1 1 IGGGC ICIGICCIGG GGACTOGAGA TAATCACTGA 
6251 GAGAGAAAAT GTCAGCIGI I TCAAAGTAGC TTAGGATCTG TTGTGGGATA 
6301 CAAATTAATA ACAGAOCAGA AGTAATAGAA TATTTOOCTG AAGGAI 1 1 IC 
6351 AATATAAGAG GACTCAGITT TACTATAAAA GGCTGAAATT CTAAGGTCAT 
6401 TTCAACAGGT GGTOGGGTTG GGGGTGGGGA AGGCATTTGA GGCCTCTTTC 
6451 TCTATGGTTA TAAATCTCAC TTGGTGAAAT TAAGACTTTG GAAAGGGGAA 
6501 GTAAGGCAAC TGCAAGTTGG GCAGTAGAAC CAATGAAAAA TGCTGAGGGC 
6551 ATCACAGTCC CATTATOGTG COCAGCTG0C AATCACATGG CACTCAGAGG 
6601 AGIGICICAC ACATACTGCT CIGICIGAGG GAGCAAGCTA AGCTTCAGTT 
6651 GTCTC7TTTT I IGI IGI 1 1 1 1 1 1 1 1 1 1 1 1 I TTTTTTT7GA GAGAGATTCT 
6701 GACTCTOTOG GCCAGGCTGG AGTGCAGTGG CAGCATCTCG GCTCACTGGA 
6751 AGCACTGCCT GCGGGATCCA AGCAATTCTG CCTCAGCCTT OCGAGTAGCT 
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6801 GGACTAGCTG GQCTTQOCAC CACAGCTGGC TAATTTTTCT ANN I ACTA 
6851 GAGACAGGGT TTCAOCATAT TCGOCAGGCT QGTCPCAAAC TCCTCAGCTC 
6901 GTGATOCAGC TQCCTOQGCT TCGCAAAGTC CPGGG ATTAC AQGCATAAGC 
6951 OXOOQCGCCT GGOCAAGTTC TCIC1 M ITA GTTGAATTTT TACCTCTTCA 
7001 CATCTOTATT CI ICI IG GCT AGGTAGAGAG GAATCAGAGA CTCTGGGGAA 
7051 GAATACAAAG AAATACAATT AAGTGGAACA I !OI I I ITCT TTAGAAAGTG 
7101 CAATTTTOQG CTOGGCGCAG TOQCTCATQC CTCTAATCOC AQQXTTTOG 
7151 GAGGGCAAGG CAGGTOGATC ACCTGAGGTC AGGAGTTTGA GAGCAGOCTC 
7201 GGCAAGATCG TCAAAG000G I I ICIACTAA AAATACAAAA AATTAGCTOG 
7251 GCATQGTOGC GGATCOGTCT AATCCCAGCT ATTCGGGAGG CTGAGGCAGG 
7301 AGA A 1 P5G 1 GAA0CTOGGA GGCAGAGGTT GTAGTGAGOC AAGATGGCGC 
7351 CACTCTACTC CAGGATOQGC AAOVAGAGTC AAACTCCGTC TCAA AA AA AA 
7401 AAAAAAGAAA A QWWWV G AAAAAA G AAA GAGCAACTTT Gl I I IAACTC 
7451 TCCTAGATAC TGGAAAAGX ATGGAACTAA TCAAGAGGCT AGGGCTTTTT 
7501 ATTTCTTTTC AGATTCTOGC ATTTCACTCC AGOCTGGGCA ACAAGAGAGA 
7551 AACI I I G T CT CACACACAAA AAAAGTCTAA ATCAAAACAT TAAAAATTAA 
7601 GTAGTTTCGA AGTAGATTAT CAAAAAGGTC CPGAAAGGGA GGI ICI I IGG 
7651 CTATAATOT TAAOGCAACT CTACACTGGC TCTATGGAGA CAGAI I ICI I 
7701 TTTAGATGGT TACAOTCACA AAGTAGGGTT TTCAGTAGCA TTTAGGGATG 
7751 AATCAATCTT GCAGCAOCTC TCCATCTATC TTCCTAGCGC CTOG AAACT 
7801 TCAGGTCAGT TAG I GL I ICC TCAGAAATTC TTXXGGCGAC ACCAAGI I I I 
7851 CACATTTACA GTTATACTGA TATCCACATT CTACIGI IGI ATCTCACAOC 
7901 TAGATTATAG GAAATTTTOG CTATAOTTCA GAAATTAACT GCTAIGI I 1 1 
7951 GOCTTTAOGC TAAAGAGATT I IGI 1 1 IGI I TAGT7K3GAAA AGOGG OCTCC 
8001 ATAACTAGOC A l l I C I GIA T CTTAGAAAAA 1 1 1 1 IACTAA CAGICLII IG 
8051 TTGAGCTAGT TACAGTGAAC AAATAATCTG GTTCATOGTC CTATACATCT 
8101 TTCACTATAA GAAAAATAGC TGAI IGI I AT TTACACTOGA AGAGAGGTAG 
8151 AAAAGCTAAG AGAACTCACT TATCGCAATA AAGCAATCTA AACTACjCTGC 
8201 TAAAATAAOT GAGAAGATTA TAAAAATGGT TCTAGGATTT TGGAATAATA 
8251 GTGAGTATCG TATGGGGGTT TCATACTTCA TTTCCAGAAT Gl I ICIGGAT 
8301 TAAGTG0GAG ACTGAATAGC ATATATAGTG AATTOAATT AAATACAACA 
8351 ATCTCAGA7T C3CTGTX3GTCT TTTTTCATQG AATTAAAAAT TAATAATTTC 
8401 AATAAAATTA ACTOCTCAAA GAACCCAATT AGGCAAAATG AAAAGCATAA 
8451 CAC AI Mill OVGGAGGGAT TTTCAGGTCT CI 1 I TAGAAT AAATTGTACT 
8501 CTGCTTTTC A TGTGATTTGC TACATCTTTT IGI IGCAGTT CCTTCAGGCT 
8551 CAGCO0CTQG OCATATACTT GCTTCACTTT T0LIGCM IC TTCCATCCAC 
8601 I G I C I I G GGG CIGI IATTTC GAAATCTCAT CACIGIGI IC AAGACTTATT 
8651 TACTAI I ICI GGACAGTTCC ATTTOGGTAC ACAGGTACAT CATACTAAAC 
8701 TAACATGAAC TCATTTTTCA GCTACACCAT ACAA0CTT0C CTTACTAOCA 
8751 AAAATCACAG G0CAI I IGIC GGCAI ICI IC TGAATCCATA TTOLICCI IC 
8801 TTAAI IC I CI " GTGCATGATA OCICIGGI IG TTTAAGTCAG AAAGCTOGAA 
8851 TCTATCTTAG CI ICI IATTT CICI ICC I IC CTTACCIGI I I IGI IAAATC 
8901 AGA GI IC I I I I G I GI I I A OC I ICI IAATCC CTCATAAATC AGTCQCAOT 
8951 I I CU I CA CT ACIGIGICI I AATTCATCCC TTCATTTACT TTTTATTTTA 
9001 1 1 1 1 I I G GGA TAGGGCCTCA CIGIGI ICCC CAGGCTGGAG TCCAGTGGCA 
9051 TCATCATAGC TCACTGCAGC CTCAAACTTC TAAGCTCAAG CAATCCTCCT 
9101 ATCTCAGCTT TTTCAGTAGC TCCAACAOCA GGCACATCGC ACCATGGCGG 
9151 GCTAATTAAA AAAAATTTTT ATGTGGAGAT GGCAI CI IGC TAI 1 1 IGLIC 
9201 AGQC1 I GICI TCAACrOCTO GCCTCAAGOG AGLCICCIGC CTCAACmOC 
9251 CAAAGIGI IG GGACTACAGG C3GGGACCTAC TCTGCCTGGC CACCTTCATT 
9301 ACTATTOGCA AGAATTAGTC ATAACCCCTT AACAGGATTC CI IGICCTCA 
9351 Gl IG TA CATC 7GAGTGATTT TTCTAAAAGA TTOGAOCATA TCAM I I ICI 
9401 GTTTAAATOC CCAGPGACAC TCATTACTTT TAGGAAAATC TCAAACTGGC 
9451 TACTCCGAAG GOCTCCAAGC TCTOGCGCTT G0CTOG0GCT CTAGCCTPGC 
9501 CO CreCl I CI CICCCI I ACT GG I C I I IG I G TTCTAGGCAA OC TCTAQG TG 
9551 TTACACTGGC OCAAAI I IGT CI IGC IGC 1 1 TTTCOCTOG TACCI I IGIG 
9601 TCTOCCACTC CTCTOTCAG TOCGATGGTT GGICCI IGIG AGATTCTGAT 
9651 GAAATOGTTC GOCATTTTAT TOTATCTCA CAATOCTOGG ACACAAACAG 
9701 TCATTTTATG CAAI IGI I AT CTATTTGATC CACTTOGAAT ATTGGGGGTA 
9751 GCTACATTTT GGAGI I I IGA GAA0GAATTC AAATAAOTTA CAAATTATCT 
9801 TTAAAGTGGT AGACAGAGAA GCTGATTTCA AOCTATTCTA ATAAAGCATT 
9851 GCGTGAAAGC CATTTTAAAG ATGATCCATA 1 1 IGI IAAAG TCGTAATTTT 
9901 TAT A I I C I CI GATATGGTTT GGLIGIGI IC OCAOCCAAAT CTCATCTTCA 
9951 ATTCTAGCTC GCATAATOCA CACI IGI GAT GGGAGGGAGG TAATTACCTG 
10001 GTOGGAGGTA ATTGAATCAT GGGGGCAGGT 1 1 ICCIGIG C IGI ICI IGIG 
10051 ATAGTTAATA AGTCTCATCA GATCPGATCG TTTATAAAGG GCAGTTO0CC 
10101 TGGACACnT CTGI IGOCTG TTGOCATCTA AGACATCOCT TTOCTCCTCT 
10151 TTCAC3CTTCC AGCATCATTG TCAGGOCTCC CTAGOCATGT GGAACTCTGA 
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10201 GTCCATTAAA GCTCT 1 1 1 IC TTTATAAATT ACXjCAGGCTC AGATAI I ICI 
10251 TCATAACAGT ATCAAAATGG AGTAATACAT TOCATTACCA TAAAGAAAAG 
10301 G UI ICA TCT AGATTATTTT TTAGAGTAGC C I IGIG GTAT GTCATTAOCT 
10351 GCATGGATAG ATAAGAAAGT TQCAACTTOC ACAGTATTAG GATTCATATC 
10401 AGTATTTACT TTTATTAAGT TCAACTTAAG AGCAGCTTTT TGGCTGGAAA 
10451 AAAGTTCTAC TTATGTCAAA GI IGIOUG A AAGTAGAATC CTACTOCIGI' 
10501 CCGCAGGCTG AAACTATTTA CTACATATTT ACTTCCATCT TCTTTAGAAT 
10551 ATTCTCTCAA TAGTOTCTGC TACTCAAGTC ATCAGAAATG CTCTGATCTC 
10601 ATTTTTGGAA AAGAATTOCA GAGTTATCAC GGOTAGCTA GAAATCTGGT 
10651 CTTATATTCA AATTAAAGAA GCAAAOCTTA ACAAAAACAA GCTAAAO0CT 
10701 AAACATAACA TCAACAGTCA GCTCAGCACA 0 1 1LIGAGCA OCTQOCTTCG 
10751 CCTGOTGOCA GOCACGCAGG GACPGTGGAT GI1H IA TTC GCAGAGATTC 
10801 AGAACAGGAA ACTOCAQCAC AOCTGGGAAC TGOGCAGAOC CACCACATAA 
10851 GACAGATAGC CTATCAGTGG CTCGAGGAAT GGAGGAAAGC AGTGCTTTCA 
10901 AATCTACATC CCAAATGTGI" ATGATCATAC CTCTTTCTTA AAGTCGCTTC 
10951 TTTAAGAGCA AAAGTAATTC CTOVOCTTOC ATATAGGAAC TAA AAAA AAG 
11001 TGGATCAAGA AATGGCTTGC CTTAI I I I LA AGTAAGAAGT CM I II I CAT 
11051 TTCACTAATT TTTAATTATG GGCATAAGTA TGAAATACAG ATTAGAAATA 
11101 CTGAATCTOG AOCAAAGCAA IGM ICCI II G7GGAOCAAA GCAGTGAATC 
11151 UN I IC I II C I I I C I I I C I I I (CI I I I II I IGI 1AAGAG ACAGGGTCTT 
11201 GCT CIC I IGC TCAGACTGGA GTGCAGTOVG TGATGTGATG GCTCACCATA 
11251 ACCTCAAGCT CTTOGQCTCA AGGGATCCTC CTCOCOCAAC CICCIGAACA 
11301 GCTCGGATTA CAGGCACATA CCAGCACACC TGGCTAATTT TTAAAAATTT 
11351 TTTTGTGGGG GAGGGTOCT CTATATTOOC CAAGUGGI I TCAAATGCCT 
11401 GAGCTCAAGT GATOCTOGAA GCTCAGGCTC GCAGAGTGTT GGGATTACAG 
11451 GTTTC AGGGA CTCTGGCTGG GCCCAAAACA GIGI I IGIAA CTCCCI I IGI 
11501 OOCCICI IGA TAAACATAAA AGTCTCATGG TACTTCAGAA I I I IGI IGCA 
11551 CAGATOCAAG TCTAGTTTGC CI I HOG IGI GAGTGGCAGA AGACAATCTC 
11601 ATACTCTOTA TTTATCCATC AGCCAAAATT TTCTCAAGCT TTACI I 1 1 AT 
11651 MM I A AATT TTTAI I IGIA II 1 1 II I I IG GAGACAGAGT CI IGCICIGI 
11701 GGOOCAGGCT GGAGTGCAGT GGGGTGATCT CGGCTCACTG CAACCTGCGA 
11751 TGCCraGGTT CAAGCGATTT TOGTCTOjOV GCXTOCAGAG CAGCTGGGAC 
11801 TACAGGCAOG CAGCACCA7G GCTGGCTAAT Mill IGI AT TTTTAGTAGA 
11851 GGTAGGGTTT OQCXATCTTG GTCAGGCTGG TCTGGAACTC CTAATCTCAG 
11901 GTGACOCAOC OQGCTCAGTC TOGCAAAGTG CTGGGATTAC AGGCATGAGC 
11951 CAOCATGCCC GGCCAAGCTT TACI 1 1 IAGT TATGI IGIGG ATATAATTAG 
12001 AATTAI I I I C I IGI ICI I IA AATACAGAAT TATAATATTG AATGTGTGCT 
12051 TTAAAAAAAT TAGTAAATCT ACCCCTCTAG AACI I CI GAT CTATGOCTCT 
12101 TAGTGAGTGA GGGGAGCTCT GCCGCTTTCC TGCTCGAGTG CCTCCACTTA 
12151 AGAATCACTC ACTAAGGAGT TTTAAATTCA ATTAAAGGTA TOCTTTAGAT 
12201 AGYTAACTCT AAATCAAAGG TCAATGATTA AATTAATGGA TAAAAGTCCA 
12251 TTGCAGCTAC GGAAAGGTGC AT7GGTOAC AGTTCAQCTA GrACCTATTT 
12301 TTTGCTAAAG ATTAAATTGC AQCTGGGGGG GGTOGCTCAT ACCTGTAATC 
12351 OCAGCACTTT GAAAAGCTCA TGGAGGGOGG GTGOOGTGGC TOVTGOCTGT 
12401 AAATCTTAGC ACTTTOGGAG GCCGAGGTGG GCGGATCACT TCAGGTOOGG 
12451 AGTTAGAGAC GAGGCTGAQC AACATQGAGA AACCCIGICI CTACTAAAAA 
12501 TACTAAATTA GCOGGGCATG GTOGOGCATG CJCIGIAATCT CAGCTACTGG 
12551 GGAAGCTGAG GTAGGAGAAT GQCTTGAAOC GQOGAGTTGG AGGTTGCAAT 
12601 GAGGGAAGAT CATGOCATTG TACTGCAGCC TGGGCAACAA GAGTGAAACT 
12651 CAGTCTCAAA AAAAAAGAAA AOCXTAGGTG GAAGGATGGC TTCAGTC3CAG 
12701 GAGTPCAAGA ACAGGCTGGG CAACATAGTG AGACCCCATC TCTA CI I I IA 
12751 ATTAAAATAA AATTAAGATT AAAAAGTCTA GTAAAATTTT TAAAAAGAGT 
12801 AAATTCTATC AGCAGIGI IC TOGCIGIGM CAGAAAGCCA AAATTTATAT 
12851 I IGIGI I ICA TTTAGATTOG ATCCAAGAGC AATTTTGAGA IGIGI 1 1 IA A 
12901 TATTACAAAA ATAGAAAACT ACCIGI I CAT TAAATOGTCA TATTTATTCA 
12951 CMMC I GIG TAGATAAOTA TTAATGCCAA GTCAAGTAGG TTAGTCTGGA 
13001 GACIGI II I I ATTAATAAAA ACCI 1 1 1 CAT CTTAATTATC CM 1 1 IA1TA 
13051 GTTTTGACTT TATGTTOCAA CTTCAAAGCA GCATOCAAA GGGTTCACAT 
13101 TTCTC A GTOG I IGI I IAAAC AAGGAGATTC CAGAAAGTTA ATTC AIGI 1 1 
13151 TAGIGIGAAT GGGGCAGGAA Gl IGICATTC TIGICCIGIA GTAACGTAGC 
13201 ACTCAAACTT TGAGAGGAGG A00CCAGOCA TCTAGTTATT GACTGTAAGT 
13251 ACACTTCCCT TAAGAGTAAG ITGIIMIIA GGAATAAATG ATAAATOTAT 
13301 GGAGI 1 1 1 AG TTGTAGGTGT TAIGI 1 1 IGI CTC3C1 ICI I C TCAGAGAAAA 
13351 ATGTTACTCT GGAATAQGCT TAAACTGAAA AAAGGATCTG AM M I A ATA 
13401 AGATCAGATT CTTTGQGCAT Al M IGATAT TGCTTCAAAA CAAAIGI I IA 
13451 ATATCAGATG CACAATCTTA AGAGCTCTAT AATCTAGTOG TAACAGCTGA 
13501 GOCTCAGGCA GT&CTACAC ATTAAGI ICI I GI I CA TTTT TTGCAGGGGC 
13551 AGGGGAGCTG GTGAGGCATA AAGAAGGGTC AGAGGAGATA ATAGACTTAT 
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13601 ATT7OTATTT TTATGCATAA TTATATAGGA AACATTAAAT GCAGAGTTCA 
13651 TAAATAATTT OTATCTATCT ATTTATTTAT 1 1 1 PGASACT GQCTCCTCCT 
13701 CTGCTGCGCA GGCTGAAGGG TAGQQOTATC AACACAGCTC ACPGCAGCCT 
13751 TCACOGGGC TCAAGGGATT TTCTTOGCTC AGCCT0GGGA OTAGCTGOGA 
13801 GCACAGGCAT GTOOCACTAC ACCTQGCTAA TTTTTAAAGT 1 1 I I I IGIAG 
13851 AGATGGGGTC TCAOCAOGTT GOGCAGGTTC GTCTTGAACT CCTQQQCTCA 
13901 AGCAATCCTC CTX3GCTTCGC CTCGCAGAGT GTTCGGATTA TAGATCTGAG 
13951 CCACCATCTT GAGCTCATAA ATAAI I ICIA ATCTAAAAAT GCTATTTTCT 
14001 ATCGAGAGGG GAGGGCAAAT AGGCTATTTT TTOCACATTT TCTTGCTGGC 
14051 CAGAATCTXA GAGGG1 ITTT AOCTOCATTA AAAATGATTA AGGCTGGGCA 
141D1 GAATGGCTCA TOCCTGTAAT GOCATCACTT TGGGAGGCTC AGGCAGAIGA 
14151 ATTCCTTGAG CCCAGAAGTT CTAGAGCAGC CPGGGCAAGA TGGTCAAACC 
14201 OC AICIC I A C GAAAAATOCA AAAATTAGCT GGACGTCGTG GCAGGCAGCT 
14251 GTAGTOOCAG CTACTCAAAA GGCTOVGGTG GGAGGATCAC CTAGCTCTAG 
14301 AGGTCAAGGG 7GGAA7GAGC CAAGATTQCT CCACVQOOCG GCAGGCTGGG 
14351 CAGOSGAGCA AGAGGCTCTC TC AAAAAAA A AA AAA AAAA A GGTTATTTTT 
14401 TTTCAGCATA GACAAAGCAG GGGAAGGAAA AGATTATCTT TCAA ATCTTC 
14451 ATTTAAACTA CTACATTTAA AGTAATACTT CXTAATCATT TAAACTTTAG 
14501 ATTAGTCTAT TTATCGGTCA GCTOGAAGAT TCTTTATAAA ACATGAGAGT 
14551 TTATTACrPC TTTOVATACA OGGGTCTCTC TAAATGATGC TCAATAGATPC 
14601 TGAAGOCTGA ACTTTCTGAA GAAATC1 PS1 GAAATTATCT ATCGAT GTCT 
14651 ACTTGAATTT AAATAAAAAT AAATTCTAAT ACATTTOGTT TTATTGGTTT 
14701 TGAATTTCTA ATTATTTOGG GA7TTOGATT TOG1 I IATCT ACAGTTCTCT 
14751 TTTTTTTTCA GGTOGAGTCT QGLI I IGICA OCAGGCTGGA GTGCAGTGGT 
14801 GOGATCTOGG CTCACPGCAA TCTCTGCCTX: OOGTCTACAA GGGATTCTGC 
14851 TCCTTCAGCT TOCTGAGTAG CTAGGATTAC AGGCATCGAC CTGCATCOGC 
14901 AGCTAATTTT TCT A1 1 I I IA GTAGAGATOG GGTTTCACCA TGTTAGCCAG 
14951 GATGATCTOG ATATCTTCAC TICG TC ATOC GCCTGGGCCT ACGAAAGTGC 
15001 TCTGATTACA GGTGTCAGGC AGOGTGCGGG GOCTACAGTT GIU I 1 1 1 1 1 
15051 ACPCACPOCC ACAGATCAAT CATTATAAGG AGGTTAGCTT TOCTTAAAGA 
15101 ATAGVCTCTC CTTAAGOGGT TTTATCAACA AAGCCAGGGA ATGOCAAACT 
15151 TTAACACTTT TAGCTAAATT TATAACTGAT GCATCTATOC ATATATACAT 
15201 ACATAGATAC ATCTATATOT TGTTATATAT GTATOGGTAT CATCAGTATA 
15251 GKTATCAGT ATAGTAATTG TTTATCTGAA ACTTOGGGTT CTCTOCGCT 

15301 ci i luuie crcTcr c r c r cicictoct cTcr ciucr c tctatatat 

15351 ATATATATAT ATATATATAA ATATATCTAT ATATATTTTT TCTTTTTTTG 
15401 AGACAGGATC TCATTOGTC ACCCAGGCTG GAGTGGAGTC GTGGGATCAT 
15451 GGCTCACTGC AGCCPOGAGC TCCTGGGCTC AAGTCATCCT GCCAGCTCAA 
15501 GCTCOCAAGT AGTTAGGACT ACAGGGGCAT GCCGCTACAC GTAACTAATT 
15551 1 1 IGIA TTTT TTTCTAGAGA CAGGG1 1 1 PG (XTTCTTOGC CAGGCIGGIC 
15601 CTGAAATGCT TCGCTCAAGC AATCTGTOCA CCTCAGGCTC TCAAAGTGCT 
15651 GGCATTACAG GTOGAGGCA CTGGGGGGAG GCTAGATTTT TTTCAATTCT 
15701 AAAAAAGTAA CCTGCTCOCT ACTGAAGTAA ATAGAGTTAA AAAAAGTAAT 
15751 CPGGTACAGA CACCTGTATT TTCPGACAOC GCTAGAAGAG TCGCAGGTAC 
15801 CCTATAATCA AATAGATTAA CATTTOGCA GCAAAATCTA 7GGATAAGTG 
15851 AGTTAAATAG AGAOCATCAG TAGCTTCAGG TCAGTTCAGA TCAAG 1 1 lib 
15901 CTTOAATTA AATCTTGATA TTCTCl IACA AAAACTTTOG Gl I IUGGTTT 
15951 TCAGATTTGC AAATAAATAA TTATAAATTA TTAI 1 1 1 1 1 1 TGAGACAGAG 
16001 Id I GLICjI O TPGCTCAGGC TGGAGTOOCA TOGCAOGATC AGGGCTCACT 
16051 GCAAGCTCAA CCTCAGGCTG AAGOCATQCT COCACTTCAG GCrCXTAAGT 
16101 AGCTOGGACT ACAGGAGTGT GOCAACATCT GCAGCTCATT 1 1 IGIATTCT 
16151 TAGTAGAGAC AGGGTTTCGC CGTCTTOGOC AGGCTGGTCT CAAACTOCTG 
16201 GTCTCAAGTG AT00GCCTOC CTTGGGCTTC CAAAGTGTTC AGATTATAGG 
16251 TCPCAGOCAC TOGGGCTGGC AGAATTATAC ATTTATATCT CAATATTTGC 
16301 ITTTCTTTTC TGTTTTTCAG TAAAGG1 1 I I TTAAGGTACA I II I ICIGTA 
16351 TCTCATAAGG CAGCTGC1 IA ATTCTTTCAG TAAGTCTCAT Gl ILIACCAT 
16401 ATTCGTCTAC CCTACGTTAC TCAACCAGGC CTCCTTTGTT TAGTGAGTAG 
16451 GAGGCAGTCT TCTACAACAT ATOTAGCATA TCTCTATATC TGGTOGAACA 
16501 AATTC7TT7T TTOOCCTCTC TTCGATTOCT TCCTTCGGTG TAGGTCCAGA 
16551 AGTCAGATTA CTSG1 PGAAA GGGTATGAAC AACTTTATAA CAGLIGI I AC 
16601 ACATTGGCAG ATTAI IU I I AGAAAACTTC AATCAGTTTA TTGTGGCACC 
16651 AGTGATCTGC TOOL I IKJLIG AAAAGGCTAC CAATCTTTOG TTTTAI 1 1 1 I 
16701 ATTAGTATTT GCTAATTTGA TAAGTACTAA TGATAI 1 1 1 I TAAAAG TAGT 
16751 TTAAAATCAT ATTTCAGTGC TTATAAGTCT GTCTTGOCAG TTTTTGAGCC 
16801 CTTTAGAAGC TGCAAATCAC CTGGCAATTA TATATAATAT TTGAAAATAC 
16851 AAGAGGAGAT ATCGCAGTGA ATATATTAGA GTAAAACTTC ATTCCCATAG 
16901 GTAATGAAGG AATGCTTCAG ATTATCTTAG GCCTTAGATT CTCAGCTGAC 
16951 ACATCTTCGC AGGTAGACCA TCTCCTTCTT TOCTC1GCTG TCTTAGCGCA 
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17001 GGIGI I CA TC AAGGICICTC TTAGGGGGGG GGATAGGAAT GGAAATAAAC 
17051 CATCTAGAGA CTTGGGCATC AQGA CTTTC1 GATTCTTGCA GGTGACATCT 
17101 CATCCTTCAG AGGATCAAGT CTGCAAGACT AGGCATATCT TAATCICI I I 
17151 CAGTOCTATC ACCTTCCATC AAGCTCTQGA CTCGA GCTAA TTOCG TTGAA 
17201 AATATTTTAT TAATTAATTT TOQQOTATCT TAAAAATTTT CTTTCCATGT 
17251 ATTTATTTAG TTATTTTTAT GAGACAG9GT CTX3GCTCTCT CACOCATCCT 
17301 GGAGTACATA GGCTTOCAOG CTCATOQCTC ACFGCAGCCT 7GACTTCGCA 
17351 GGCTCAAGTG ATCCTGGCAC CTCAGCCTCC TOVCTACCTS GGACTACAAG 
17401 TCCATCTCAC CACATTTGQC TAATTTTCAT ATTTTTTCTA GAGAOQQGGT 
17451 TTCGOCACAT TCOOCAGGCT QGICICAAAC TCCTOGACTC AAGGGATCTG 
17501 CCTGGCTCAG GCTOGOGAAG TCCTCAGATT ATAGCTGTGA ACDVCGGGGC 
17551 OQQQCrCCOC ATTTAATTTT Gl IGI 101 IG 1 1 1 1 I IAGAT QGAGTTTCAC 
17601 TCTTCTOGGC GAAGCTGGAA TCCAATGGCA GGATCTOGGC TTAOQGOCAC 
17651 TTCCAOCTOC TT3GCTTCAAG OGATTCRXT GTCT CAGCXT CGOGAGTAGC 
17701 TQQGATTACA GACACAOQOC ACTATAOCTC GCTAAI Mil OTA I I 1 1 IAG 
17751 TAGAAATGGG GTTTCAGCAT GTTGGTCAGG TGGGTCTCAA ACTATTGAGC 
17801 TCAGGTCATC CAOCTGOGPC GGCCTCGGAA AGTGCTGGGA TTATAGATCT 
17851 GAGCCAOCAT GTCCAGGCAC OCATTTAATT TTTTGAGCAC AAAATATCTA 
17901 CTCAGAGCCA OGCAAGAAAC AAATTCGACT TATTOCATOC TCTTGAGAGG 
17951 TCATCAGGGG AAACAAAATC ATACATAAGT AACT OGAG A GAATATGCTG 
18001 CACATGCTAA ATCCTGTGCA AGTAAGATAT AQGATTTTAG AGGAAGGGAG 
18051 AA7GAOTCT GATTCAGCTG ACTAGAGAAG GCTTCAGnT TTGAGTTAGG 
18101 TCTTAOGAGA 7TCGGAGACT TTTCTCAGCA TATCTAACAG AAGAGGGTAT 
18151 GOGAGGTCAG AGTGTAAGGC CTCGGCAAGG GTPGGGAGGC AGTP CTAATA 
18201 CTGAATCTTC 7GACTG7GGT TTACTATCTA TTTCAGGTTA I 1 1 I G i l IAA 
18251 TCTATOCAGT AAltCI I PCA TCTAACAATT ATGATGTGTG 101 1 I IAGCT 
18301 GGGGCTACTA AGGCTAOTAA GTAOTGAQGC TOGATTTAAA CTTAAGTCTC 
18351 CAG L I POSTO GGOCAGGTTC TTTATACTTG ACTOCACACT GGGCTTATTA 
18401 AGTGAA7GAC AAGGAGTTTG ATTTCTOGAG GGCGGAGGAT ATCTAGTGAA 
18451 AGGTCATAGA CTATAAGGCT GGCAAAGAAT GGTGGAGCTG GGGAAGGGAA 
18501 GATCTAAATC COAAGOTAAG CXTGGAGATT ATTTAATAGA GGCAATGGGG 
18551 AGATACTGAG GATTTCTGAA GAGGGTT7CT AAGTAGTGCT TTAAAAAGAT 
18601 CTtTGGGl nG CAGAGGCAAG AAATAGGAGG AACACCAATT AGAAAGQGGC 
18651 GAGOGAAAGC CGAGTGAGAT TTCAGGGTCA GCGATOGAAA TCGGAATGAA 
18701 GGGACAGATG TCAGACATTT CAAAGGATAG ACTGCAGGAA TTCTOGCAAA 
18751 GAATGGOTTC GACAGTGATG AAGAACAGAG AAACAGCATT GATCGCTAGA 
18801 TCTCGAGOCT GIGIC I IA AT AATTTT7GAT CAGGGAGAOC TACAATACAT 
18851 1 1 IGIATTCT GCXX3CACAAA ATCATCTGGA CAGOOCTAGA TCATCTCATG 
18901 GACAGOTATG GGGCATGAGG AGAAAGAGTT GOGAGGGAAA GGCTOGCTCA 
18951 CATTTTAGAC ATCTGGAGOG TTAAGGGTCA GCAAGACATC AAGGGGGATG 
19001 TCAGGCAGGT AACTGGAAAT GCAAACTGGT AGTCTCAGAG TCAOGCTGAG 
19051 TCCAGTGGTG GGGGTTGGA AAGAAGATCT ATCTCTAAGA GGAACATCAC 
19101 ATAATGGAAA GCDVGAAATC CIGICI I GAG OQGACTCAGG GGUGU IU 
19151 AGCTAATTTA GTTCAI I IU ACTGAAATCA TTATATTTAA AAGTATGGCC 
19201 GCACTCAGAT GGOCACTGTA GCTGCTGOCA GLICI IAGCT I IGGICI IAA 
19251 AAAAA AAAAA AAGTAATAGA ACTTOCTTAA AAIGICI I IC TAGOLI I IGG 
19301 A l I I C I CIA A TTCAGATTTC OGTC1 POOCA AGGGTCAAAA TTATAI I 1 1 I 
19351 ACTATOOCTG TCTTAGCTAT TTCCAAAACT TCGTO IAAG ACTTAGTCAT 

19401 n mrocn caatttgaca tgactgctaa agaci r iigg catcttcctc 

19451 CPCLI 1 1 CAT I IGIGATCTA ATTAAOTTGG TCTGTAAGTC TTATTTTTAA 
19501 GAIGI ICIAG ACGAAGAGAC TGTGAGAATA GCTTACAGTC ATTTCAACTA 
19551 ATTTATCTAT TTTAAATTTA AAGTATTGAC AGTGGTGAAA AGLIG1 ICAA 
19601 CAAGCAGATC ATGTTATCTT ATATATTCAC AGAGTTTAGT AACTGAGOCA 
19651 ACTACTTCAT TCAOVGTTCA AAATGAAAAC AGCTAATTCT TTTAAGTAAG 
19701 TATAGATTCT AC I CI 1 1 AAA AGAGTTATCT AGGAGAGCTA CTATAATACT 
19751 ATTATAGAAT AAGTGAAATT AAGTTATCTG ACTGGGACTG GGGTOGAATA 
19801 TTOCAGAOTT TATTGAGAAA TATCGACTCT TTTTCCAGAT TCATAATGGA 
19851 CATTTGAGGT I IG I IA GGGG AGGAGGTCTC AICI I IATOG CACI I ICIGG 
19901 CTGGGAAGGG AGTCAGTOCT AATTGAGATA ATAACTAGOC AGCTGGOCAC 
19951 ACACAAGTCT GTTTTOQCXl ACTTACOOT CACACACTCA GCAGTGAGAC 
20001 TCAAGAGACT GTCAAAGTAC I 1 1 ICAATGC ATAAAGCACT ACAGATCTCT 
20051 CCAOACTOTT GTGAGTGAGC AGGTTGGCAC GG7GGCTGTG TGOGGGGGTG 
20101 IGIGI IGACT CAOGTGCPOT CCTGTGATCT CTCAGGACTC AGGTOCTGAA 
20151 I IGCTCVGI I GTGACTGAAG GCCAGCTGAA GOTGCTOGAA GTGCAGTGAC 
20201 GCTGGAGGAA GAAGCAGFAA CAGCAGAGGG TGGATGAAAG GGAAT TCATA 
20251 Gl IGIGIA AG AATAGATATC TOO CG I I I I I TCTAAGOCAA GAGAOCTTTA 
20301 COLI ICCAGT AAIIGIMCA ICI 1 1 IAATA TCATTTCGCT TCATTTAGAG 
20351 A A I C I GIA TT AAAGGAAAAG TGAGCATGTA AGGTGOTATT TTCAGCACAT 
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20401 TTCTCTCCTC I IGIGOCTTC TGGGTGAACT GAAGTACTGG CTTACTCACT 
20451 ACTAAATATC TTTTCTGAGA ATTATAGGGA AOGGAAGAAA AGGAAAGTCC 
20501 AATTAAAGCA MM I CI O CT CAGAGTTTGA AAAATAGAAT TCATCCAATC 
20551 TTTTAAATTC CATOOCAACA CATCAGACAA GAAGAGACTT GATAGTACTA 
20601 AAQCTTOOGA ATCAAAAGAA CAATGTAACT TTTTGATATT GACTTCAAAA 
20651 CATGGTGTTC TATAATTTAG Id I ICATTTG TTACGTCTAT QOTATTATAA 
20701 TTAATTTTGT ATATGTGGTA GTTATTTTTT GTACTTPGAT TAGGAAACAA 
20751 ATATTGAGGC ACTTCAAGAG GCAGACTATT TTGGAAAAAA AAGTCPGGTA 
20801 AAAGTAATGC TTAATOAAT TAATC1GICT TCATOCIC! I AATTCATCAA 
20851 GATCACTTTG GGTGTCTGGT GGCAACTGAG AATOQGTTAT GGAAGAACGT 
20901 CAAGGGAATC TAATC3CCTAT TATTTAOGGT TACTTCAGAG GATAAATTAA 
20951 TCAGCGGTCA CTAATCTTTC GATAATCACT CTATTGAGCT GGAACTATCC 
21001 TTAGTATTTT GGAAAGCAAG TCAGTGAGTT AGAACTGTCA AAACTGATCA 
21051 GCTTFTCTAA GOTAATCAT AAGTCAATAG AAACTAGTTC OCTTCAACCC 
21101 TTTCCTCOCT GCATTGCAGC ATCATCATTC TCTAACTCTG GAAATCCTTT 
21151 ATCGAACAAC AGTGAAAATA CATTGATACA CTGTC1 PGTG GTAGATTTTC 
21201 AGATAQGCTT TAGACAAAGT TCAGAGCCTT TOCTCTAGCT GGGGATTAAC 
21251 AAAGCTG0CT TCATAGTTAA ATCTTTCCAC GCTGTGTATC CATTTTCAGT 
21301 TACTAGAATT AGGTAAGTTA GTGTTATAAA ATTOGTTTX5A GTGTGGATTC 
21351 TTTAGGAAGT GAGTCI I I PG GTGGCAGCAA TTCTGTTATC CATTAAATAG 
21401 ATACATATTT TCAACTAGTC GACAMGII I CAGTCTGTAT TTATTAGATC 
21451 CTGGGGTGGG TATGGGAATA AAGAAAOCTA TGAGGGOTCT TCGAAA AGTT 
21501 OVPGGAAAAA ATGTACACTA TCAAAAAAAA CTGTGCATOG ATTTCAAAAT 
21551 ATTTTTCAAC CAAAATCAAC TTGTACTAAC TTGTTACAAC ATCTCTGAAC 
21601 TTCTTACAAC ATCTCTGAAC AGTP0GAGAC ATTAAGAAAT GATATCGCAC 
21651 CA G1 MMA A AAAGCGOCTA TCAGGGTAAC ATGAATTCTG CTAAAATTCA 
21701 AGCAAGAACA AACATCAAAT TTATCGTOGT GCTTGQGTAG AAGGATCGTG 
21751 AAATOVTTGA TGCTTTACAA AAAGTTTATG GGGACAATAC TCTAAAGGAA 
21801 CCAGCAGmr ACAAATOGCT AACATCCTTT AAGAAGGGAC GAGATCATCT 
21851 TCAAGAGGAA G00CAOV5CA GTAGACCATC CGIGICAATT TTCAAGGAAA 
21901 AAATTAATCT IGI ICATGCT GTAATTGAAG AGGGAACTTT ACATCAAATT 
21951 TTAAACAAGT GGGATTGAGA TCCTCTGGCA TATCTCTGAA GGAT7GTAAT 
22001 AGGAGAGGAA ACATCGCTTT AOCAGTATCA TGCTGAAGAC AAAGCACAAC 
22051 CAAAGCAATC GCTAOCAAGA GGTGGAAGTG ATCTAGTTAA AGCAAAAGCA 
22101 GACTAGTCAA GAGCAAAGGT CATCATAAGA GACTTTTGGG AT GCTCA AGG 

22151 t ai rrrecn gttgactttc tggggaggga aagaatcata atatttgctt 

22201 ATTCTGTGTG TTTTCAGAAA ATTAGCCAAA Gl I I IAGCAA AACAAAGAAA 
22251 TAOOCAGGGA AGCTTTAOCA GAGAGTOCTT CTCCAOCAGG ACAATCTTCC 
22301 GGGTOVTCCT CTCATCAAAA AAGGGCAATT TTCCAAGAGT TTTGATOGGA 
22351 AATTATTAGG CATTCACTTA CAGICM M I M M M M IA GTPGGAGTCT 
22401 TACTTTCTCT CGCAGGCTGG AGTGCAGTGG TGCAATCTTC GCTCACTGCA 
22451 ACCTTXACCr GGCAGGTTCA AQCAATTCTC CTGOCTCAGC CTQO CAAGTA 
22501 GCTGGGATTA CAGGGGTGAT GCAGCATCTC CAGCTATTTT TTCTA! MM 
22551 AGTAGAGATC GGGTTTCAGC ATC1 PGAOCA GGCTGCTCTT GAATTCCTGA 
22601 CCTCAGGTGA TTCACCTGTC TCGGCCTTOC AAAGTGCTGG GATTAGAGGC 
22651 ATGAGGCACT G TO0C F SG 0C TATCTTACAG TCTTCATTTC GCTTT ATCTG 
22701 A G TCI MM OTTOCTAAT CTTAAAAAAT ATTTAAAGGG CAGCTATTTT 
22751 PCTTCAGTTA ATAATCTAAA AAGGACTGGA TTGACATGAT TAAATTCCTG 
22801 GGAOCCTCAA MCI MAGAG ATGGACTAAT GGCTGGTATC AACTCACAAA 
22851 AGTAICI IGA ACTTCATOGA GCTTATCTTC AGAAATCAAG TGTATATTTT 
22901 CATTA1C1 I I TAATTTCATT CTTTAGTGAA TTTTTTGAGG TO0CCTTCTA 
22951 TACATTTTAA TCCTAAGGGA ATAAAGAAAG GAGGAAGTOC TAGCCCTCTG 
23001 CTGlCnG C CT AGCTACAGTG TCTGAAACAC AGACCAGTAT TCAOCJLI I PG 
23051 AAATTPGAGG 1 1 IULATTCA GGAGG1 PCTC AAAGAGAATA AATCAGATPG 
23101 CTATGGAGGT GGAATCAAAG AGCACAGGGC TTATTTATCA TAATCAAAAT 
23151 AATGGGATTT TGATAAGAAA CTPGACCTOC TTATCTACAT TCTAAATTCT 
23201 7GCCTTCATA AGCTTC00QG AGATAAAGTA ATTCAGCTAA GTATTATTTC 
23251 CAATCATAAT TTTCTGTCAT TATGAGCAAC ACAATACTAT ATATCGGATT 
23301 GATTCACTGC AGAACTGGAA TAAATATAAA TTAGAICM I AGAAAAGAAA 
23351 CGTAGATTTA AAAATCTTAT GTTAGAAGGC TCAATTAATT AAATGTAATT 
23401 A A1TMMA A AATCAGCTTT ATTGAGGGAT GACTTAGATA TTATATAATT 
23451 CACAAATTTT AAGTGTACAG TTTGATAGTT CTGACAATGA AACTGTATAC 
23501 AATCATCTAA CCAOCATCAC AATCATAATA TAGIGIGICC ATCAOOCCAG 
23551 GGTUTAOOCT OGTGATCCTT TTTGCAGTTA GIU I 1 1 IOC CTTACATTCT 
23601 GGGP O CTGAA AACTTGATCT GC1 1 ICIGIC ACTATAGCTG TGCCI 1 1 Id 
23651 AAAATTTTAT ATGAATGGAA TCATACAGTG TCTTTTCm TCTAICIGI I 
23701 TTTCACTCAG CATGATOCTT T7GAGATTTC TC U I GMG I GGTATCTATT 
23751 AGTA GI ICI I I CM M MA T TACTAAGTAG TATTOCATTC TATGCCTATG 
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23801 OCACATTTTT 1 1 III Mill TTOGAGACAG AG 1 1 MGCIC TGACA7CCAG 
23851 GCTGGAGTGC AGIGG1GTGA TGATGGCTCA CTCCAGCCTT GACTTOjCAG 
23901 ACTGAGGTGA TCCAOCTCCC TCAGCCAOCT GAGTAACTGG GAOCACAGGT 
23951 GTCTGCTAGT CIGICTA ATT TTTAAATTCT TTCTAGAGAT GGGGGTCTCT 
24001 GTATATTCCC CAGGCTGGTC TTAAACTOCT GGCCTCAAGC AATOLI ILIG 
24051 CCTTCGGGOC TCAAAGTGTT GGGGTTACAA GTCTGAGGCA TCACACCPQG 
24101 CCTACCACAA I I I 1 1 I A TOG ATTCAGATAT TCATCGATAT TGQG1 IGI I I 
24151 TCAGTTCTTC CCTATTATGA ATAGAACTGC TATCAAGATT TGTATGCAAA 
24201 CCI I IG I PS G GATCTATC1T TTTAI I ILIC I 1 1 IGIACAT TAAATTTAAA 
24251 TTTAAATTTT Gl I LI G I ATT Al I IGIATTT TTAAATTTCT CAAGTGGGTA 
24301 ATACTCTCT A CM 1 1 M I! I GAAATTAAAA AAATTCTGGC TGAACAAGGA 
24351 GATAAAAAAG TAGGAGTGAG AGGACPCTCG AGAGTTACAG GGCTTTOGTT 
24401 TAGAGGATTC GATCAATAGT GGIGCIGOCA ATAAAGAAAT TTAAATATQG 
24451 CTGATATTTC CTATATTTAA GAAAGAGCAA AGAGGGTGCA TTGAAATTCAG 
24501 TCAGTGGGAA ATCTCPGATC ACTTCAGOCA GCAOGCTTTC AT0QG0CTQG 
24551 ATATATOQGA AGTGAAGTCT GATTATAGTC TGTGGAGGAG TCAATCGGAG 
24601 GAAGAGATAG GGGTACAGGC TAAGAAGGGA GGAAGTCAAG TCAAAGGGAG 
24651 AAGTAGGGTG GTAGCTAGAG GAAGATTAGA GTCAAGCGAG GCTAACAATT 
24701 M I M I MM TTCAAGATAG GAGTAGCTTC AGAACTAACT TAAAGAAGGA 
24751 GCCTCTAGAG AGGGAGGAGG TGAAGTTACT AAAGGTCTAA TTCATGGGGT 
24801 AAGGTTCATC AGCAGATCAG ATCTTTACAA GGAAGGTTCT TGCTCGGGGG 
24851 CAAGATTCAA AAG00CTATT CAGMCCACG AGAGAAGAAA GTAAGAATCG 
24901 GAGCAAATCT AGGTAGGTTT GGTGAGGATC AGGAAATGGA GGGGAAGAGG 
24951 TCATTAAATG TCGTCCTCGG GTTGAGGAGC AGATTOGAAG AGAATOGCAA 
25001 AAGTTTGGGA GCTGATTAGT GATAAGGAAA AGCTTTGAGA ATCC3GATGAA 
25051 GATTAGAAAC CATGCATTTC TAGTGAAACT IGI I ILTAAG AT7UTG0CCT 
25101 TACGCACCTC CAGCTCTGCT CPGATAGGTG AACTATACAA TTGATGTAAG 
25151 GCTOGCAGAT AATCAAAGCC AAGAAATTTC A I G I I I IC TA TCTATTTTCA 
25201 CTTCCGTOCC ATATAGCTTC TCTCATATAG TACPCATATG TTTPGACTTT 
25251 TTOGTGAAGC TCATTAAAGA TTTAACAACT I 1 1 1 1 II MA CAM I II IAT 
25301 TCTCAGAAAA CATATATAGC ATAAAATTAC CATTCTCGTA AATTTAGCAA 
25351 AAOCCATTTA TAGGATAAAA TTTAGCATTT TGGTCAACTT ACTCTAAATT 
25401 ACTGOCAGTT CTCTACAG0C AM IGI MAA 7GCAGTCTGC TCTAATAO0C 
25451 TGCAAGGCAA AAT7GCAAAT AAAAAOCTOT TTTCATTTCT ACTTAATTTT 
25501 GOTAAATTTA CCAAAAGGQC AAATTTTATG CTATAAATTT I I M IGM I 1 
25551 GACATTTAAA ATAIGI 1 1 IG TAAAA AA A AT TGAGAGATAA AATTGTATAT 
25601 TTAACATCCA TAATGIGATC I I I IGIATTA TGTATAACAT GATCTATATG 
25651 TOVTACATAA CATATATACA TTCTAGAATT GTTAAGTCTA GOTAATTAAC 
25701 AAATGCATTA OCTCAQVCAG TTATCATTTT TCTGGIGAGA ACATTTTAAC 
25751 ATCGACTUTC TTTAAATTTT TCAAGAATAA AATTTTATCA TCTGGTCATG 
25801 GTGGCnCAOG GCTCTAATCA CAACACTTTG GGAGGGGAAG G0GGGAGGAT 
25851 TGATTGAGAC TAGAAGI ICI AGAGCAGTCT AGACAACATA CTCATACCGC 
25901 GTCTCTACAA AAAATAATAA AAAAGAATOC AATATTCTCT AAA I I ICI IG 
25951 AATTTATTTT TQCTAOCTAA I IGICGI IAT ATATCCTTTA AGCAAOGTCT 
26001 GGGCATTTOC CTCTOCTCTC TAAGCAGTTX AGCCTCTCAG ACCAT7CTAC 
26051 1 1 ICIATTTC TATGAGATCA GCCI 1 1 IAAG ATTCTACATC TCAGTGAGGT 
26101 GCTGTAAGTA I I IGICI I IC TGTACjCTCGC I IGI 1 1 IACC ATTCTCAGCA 
26151 T7TATAAAGT GTACAATTTA GTGGCATTAA GTACATACAG AATO LI G I IC 
26201 AOCCATTACT ACTGTXXTAC AATTTTAATT TGAGGGATCA LI I IGI CATC 
26251 ILILI ILIAT GLI IIGIACC Al ILMGATC TATAAACTCA ACAAAGTCCT 
26301 TAACTGAGTA GAATAI I II I GTATGTCTCT ATAGATCTCT GTGTGTCTAT 
26351 ATAGOTATCT ATAGATATAG ATATAGATAT AGATATAAAA 1 1 1 1 1 f 1 1 IG 
26401 AGAOVGAGTC I IG L IL I G I I GGGOSGGCTG GACTGAGTGG OVTOATCATG 
26451 GCTCACTATA GCCTCTAGCA TACAGGCTCA AGCAATCCTC TCAGCTCAGC 
26501 CTCOCAAGTA GCTAGGACTA CATCCATGCA GCAGGATGOC TCGCTAATTA 
26551 AAAAAAAATT M II M I ICI AGAGATGGGG ICICIGIGII GOCTAGGCTG 
26601 GCCTCAAAGC TCTAGGCTCA AGCAGTOCTC CTGOCmGGC CTOGCAAGGT 
26651 GCTGGGATTA TAGGAA7GAG CCACCGTGCC TGGGCAGTAT TTCTAI MM 
26701 GATACTGAGC TTAATTTTCG GAAGTCCTCT GCAAGGCACA AATTCTGGTC 
26751 TTAGLI I ILI AACTATTTOG TTGTAATTAT I IGI IAATAT LIGILI I HOC 
26801 TAGTCATOGA AAGTCCA7GA GTGCAGGGGC TC I GIC I G I C I IGI I IACAA 
26851 CTATATTTOC GCTGGCCAGC ACAGTGLI 1 1 ACACATAGTA GACATTCAGT 
26901 G A I GI I ILIG AACAAATGAA AGILLI ICJbC TGGAAGAAGA ATTTATTTCA 
26951 GTTTAATAAG I ILI I ILIGA ATGCTTCATA TGAAGTAGGC AAAATTTCTA 
27001 TTCACATATT AATAGAGATT ATAGGTA TOG CACATTCCAA TGTCTATTTA 
27051 ATTAGTAGAA ATCAAAAAGA Gil I I IATTT GGAATTTTOC CTTGGAGGGT 
27101 ATTAGAGCAG ALIGIGCACT AAACAAATAG CTOTAGAGAA TTOGCAGCAA 
27151 TAATOOCAAA GT I IGAAGGC TCTAGGTTAA TTAGGATAAC TTCACAGGAA 
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27201 GTGAGTTAAT CAAATTTCAG AGTTTAATCT TCCAATAATT TATGTTCAGA 
27251 CATACTTCAA GTATATOVQC AGCTAACAGG AACTTTAGTT GCAGAATCGC 
27301 CXAAAACACA AGAACTGOVG TOGA I 1 1 1 CI" GGCTTGCAGG AATGTTTTOG 
27351 AGGAAGAAAA ACCAATAAAA TGATTTOGGG GTCAI 1 1 IU TCCATTACTC 
27401 TATATTAAAT ATACTAGAAT TTAAAATATT AAATTTTAAA AGATAAAAAG 
27451 ATGCAGTTTA CXTATTAACA AATTAAATAA TTTAGGAATT CTACTTAGTT 
27501 CTCTAATACT TTAATATCAG TAAATATOQG CATTTCTCTG TTAGCTAGAA 
27551 TTAGATAGAG TATTGGCATT 1 1 1 I ICAACT GGCTTATCGT TAAATGGAAG 
27601 TAAAGGGGCA AACTACACAT ATAAGAATTA GTAOTACAAT ATTTAATACA 
27651 COOCTGTAAG AGTTATCACA GTGTCTCCTC TCTGAAAAGT AAGGGCTOCA 
27701 rCTSraCl PS TGAAAAAGGC CACTGGAGGC OLI 1 1 ICAAA AATTAAATCT 
27751 GGCTCCAGCA AQGTCTTTTT CGATCACATG GAAAGGGGAA GAAGAAGCTA 
27801 TCAGGAGCTC TQQQOTTTTT TTTCTTCTTG TTTCTTTGTT TTOOCACTTT 
27851 TAACTOCAA GCTAAAACTC GGbl I ICATT TGAGGAACCA GTAATAGAAA 
27901 ATTTCTTATG TACATTCAGC AAAATCTACT ACFGAGTT3OT TACTTTCGCT 
27951 TTTCA1TCTG GGG AT TG IG I" GTCTUTGAGT ACATGCAOGC AC I IGIGIGI 
28001 TTAAGOOTGT AAGGCAGACA GACAGTG3GT ACAGblCI 1 1 GAAATOGACT 
28051 I C I PQGCAAA AGTAATAGAG AAAAAGAGGA ATACAAATAA GGGAGGAGGG 
28101 ACAGGGAAGA GCAG^GTCAC AGGAAACAGT GAATGAGGCT GCAGTCTCAG 
28151 TCBGGCTTTC 1 1 I G I CO CTC CAGIGI lb I I GOCTGTCTTA TGATGATGCT 
28201 GGTTTTCAGC CAAOGTTGAG TGACTAAAAG COGGGTCTGA GGTCTCAGTC 
28251 GCTGOGTGGC TGATATGAGC AGCTTGCATT TOGACTGGG GOCTGGAGCA 
28301 GCAACAGCAC AGATTTOCAG GAAGAGTTOC TCTTCTCATT TTTAITCUG 
28351 AGTCATCAAA TTTAGTTATT CAGACGTCPG CTCTTOGCAG C TACATACAG 
28401 ATCAAACAAG CAGGGAATTT 1 1 1 ICI I ILI CI 1 C1CIO OC ICI I II 1 1 II 
28451 GTATTTCCAT CTTCTTTTC1 ATACCI 1 1 IC 1 1 IGI I IAAG TOWGCATTT 
28501 GAACATCACT AGFTACCATT TOCTTTAGCA AGCATAGGAC I ICIGICI IA 
28551 CTTAAATCTC TTCTAATGCT GTGAIGIGIC ACAGTTAGTT GAGACGTTAA 
28601 AGATGTTOGA TACATCTGAC TAGATTGGTA AATCTCAAAA ACATCATATC 
28651 GAATGAAAAG ACAAAGTTCT GAAATCTCTG ATCTGATOGC Al I ICCI IAA 
28701 AAAGTCATGT AAAGCAATCC TAGCACATCT CATAGAATCT AAAAGGCTAT 
28751 ACTGATOCTA AGATOCAOA TTAI 1 1 ICIG TACACTGAGA AAGGGGGAAA 
28801 ATTCCCACTT AAA I IGIGAT GTAATCTCTC ATTATOGATT GTAAGATACA 
28851 TCTACATTTT AGAAATOGTA TAATCTTAAA AATATCCATT TTA AATTGA A 
28901 GGTAAATTTA ATTAAATTAT TTCAAAGGAA ATAAGGTAAA TCTATTTTAT 
28951 7GAAGCATAG TTATCTAATA AAAATAGAAA AGCATGCATA GGAATGCTAT 
29001 TTAGOCAATA CAGGATCTGG TAATCTOAT AAAGGGAGGG AGGGAAATOG 
29051 AGGGGGOVGG GCAGAGGAGG GGOCTCATCT CTCTAGTTTA 1 1 1 1 1 IAAAA 
29101 TTATAAAGCA AATATTAOCA GAOTTAAGAT TTTACAAAAT TCATTGGTAA 
29151 GCAGCTATAA III ICIG AGT GCI I ICAOTA TTTC ATAATC AAAAGTATCT 
29201 ATTTTAAAGG TAOGTTATCT ATTTAI 1 1 I I ATTTAI 1 1 1 1 TTTGAGACAG 
29251 TCTTCTCATTC CATOGCOCAG GCTGGAGPQC AG I GG I GIGA ICI IGGCTCA 
29301 CTGCAACCTC CTTAAAGCTA CATTATTTAA AAGPCAGATA CAAAGCAAGT 
29351 TOOAGAAGGC TCTATCTAGT GGATTCTATT 1 1 1 1 1 IAAAT AGTATTTAAT 
29401 TCI ATC1 I CI TCTACACTTT TTOCTATCTC CATCTTAOCA TAGCTCTGOC 
29451 I M I I TOGTC GAAGTCAGGA CAGAI I GCI I TCCACATCTC CATTTTTGTG 
29501 TCTGAATTAA AAGATGGACA AGTATCATCT ATTATGTAG TAGTCATCAA 
29551 ACAAGGAAAA A QGTTTCTTI Gl 1 1 GCI IGI TTTTTTTAGA TGAAGTCTCC 
29601 GGCCAGGCTG GAGCGCAGTC GCACGGICI I GGCTCACTGC AACCTCTGOC 
29651 TC CIGIGI IC AAGCAGTTCT CTOCCTCAGC CIGCTGOOTA GCTGGGATTA 
29701 TAGGCGOCTC CCATCAOGGC GGCTAATTTT TCTATTTTGA GTAGAGACAA 
29751 GGTTTTOOCA AGTTOGOCAG GCTCGTCTTC AACICCIGAC CTCAGGTGAT 
29801 OCACCTGCCT TOGOCTOC3CA AATTCTTOGG ATTACAGGGG TCAGCXACTG 
29851 TOOOOGGGCT GGAAAAAGTT TTTAATOGTA AAGATCTOVT GGAATGAATA 
29901 GGATTOGCTG GCATTATTPC I IGCIGI IAA TAAGCAGTCA GAAAIGI I IC 
29951 GATTATATCT MCI 1 1 IAAG OCAG C I I I CI GU IGUCCC TTAI ICI I IC 
30001 I I ICI CA GGC ATCTGGTATC TAGAAAGGOT CAGGAGTAOC TTGATAAAAA 
30051 TTATTCTACA AGTTGAGCAA AACTCAGTAG TATCATOCCT AGAGATOGA 
30101 TAAAGAGGCA CTTTTAAAAT AGAGOCTTGA AGCTAACAAA CI 1 1 I II I I I 
30151 TTAAOOCCTT TACTGAAAOC TAAAAAGAGG TCTGCAOTTT TTOOCTGCTC 
30201 TCAGCAGATC AAAGGGCTUT AGTAGIGI I I GCTTATTCCT CAGGCAGTGA 
30251 GGAATCAAAG GACTGAAGGG GTOTTTCTTI TATCATAOA CI IGIAGGGA 
30301 COCCTTACCr GCTATAGOCA TCA AAAAG GA ATA I I ICI IA TATCCCATAA 
30351 TATTCTTTTA GTATCAGACT TAGG I I I IAA TG1CCI I ACT GTTAGAGTAA 
30401 AATAATTTOG GCAAGAOCAA II I I I IAAAT GGCAAATATA GTCACATCAC 
30451 TTGATTCAGA ATCTCACTOC CTAGCATCTC GGGTAATGAC TCATAAGAAA 
30501 GAAAAAGCTA AATGCATGAA GAGGTTCACT ATACOATTAC TATAAAAAAG 
30551 TAAAAATTTG Tl 1 1 IGI IAT 1 1 1 1 I I I I It, TTTTTTGAGA TCTAGTCTCT 
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30601 CTCTGTPGCC CAGGCTGGAG TCCACTQQOG CAATCTOGGC TCACTGAAAC 
30651 CTCCGGCACC TQQGTTCAAG CAA1TCTCAT GGCPCAGGCT GGGGAGTAGC 
30701 7GGGATTACA GGCATGCAGC AGCATCGCTC GCTAAI 1 1 1 1 OTATTATTAG 
30751 TAGAGAGGGG GTTTCAOCAT GTTGGCCAGG CTOGTCTOGA ACTCCTGAGC 
30801 TCAGGTGATC CACOCAOCTC AGOCTCXXAA AGTCCTGGGA TTAGAGGCAT 
30851 GAGACACTGC ACCTGGGCAA AAAAAGTAAA AATTTGAACT CAACTTAAAT 
30901 GGCCGCAAAT ACAGGAATOG TTAAATCCTG ATCAOCTACT TTATCTAOA 
30951 7GATGCAAAT GCAAATCATA ATOGTTACTG TQCAGCAACA TQGAAGAACT 
31001 7GGTGAAAGC AQGATCTACC ACATGACTGT AACTCCAAAA TCTOCTTCCA 
31051 TCTGAGGAGA GATCAAGGGA ATATAGAAAA ATCAAGACAT AGTTCTGTTA 
31101 GGGTGGTAGA ATTTTCAGTG GATTTO00C OCCACTATTC GTAAAAATTT 
31151 TTCTACTTAA TTGGTTCTOG QCAGOCAGAT CTTTTAAAGG TAAATTTCAA 
31201 mOCTTTA AGAAAATCGC AGACAGAAGG ATCGGGGATA CTAGAAAACT 
31251 AAAAGTAGTG C0G0CTTTCA AGATAAAACT AAAACATTTT AAGGCTGGAA 
31301 TTCCTTTAGC AGTAGATCTA TTCATTATTT AAI 1 1 JO IOC TTTAGAAGAA 
31351 AGTTOGOCCA ACACAATTAC ATGGAAGTTC GGTTATTGAA GAGGATTGAT 
31401 AAAAGAAGTG GGAAGGTCAG GQCAGGIGTG GTOGGTCATC GCTXJTAATOC 
31451 CAGCGCTTTC GGAGGGOGAG GTGGGTGGAT AAOGAGGTCA GGAGATGGAG 
31501 AtXATOOGG CTAACAGCTT GAAACXXOGT CTOACTAGA AATACAAAAA 
31551 AAATTAGO0C AGCATOGTCT TOGGTGGCTC TAGTOOCAGC TACTRGGGAG 
31601 GCTCAGGCAG GAGAATGGCG TCAACCCGGG AGGOGGAGCT TGCAGTGAGC 
31651 CCAGATCAAG GCACTGCACT OCATOCPGGG GGACAGAGGG AGACTX30GTC 
31701 TCAAAAAAAA AAAAAAAAAA AAAAAAAGTG GGAGGGTCAA AGCCAATCTC 
31751 CACXJTTTATT C I 1 1 PGTA OC ACCAAATAAA GQGAAI I I 1 1 GGAGCAACTT 
31801 TACTTTAGTC AATATTTOOT TATTTGCTTT GAAGCTATTT Gl I RGGAATA 
31851 ATGCAATTAA CATCCPCAAC AGAGOCAAAG 7OGAG0C7G AAATOGCAGC 
31901 ATCTTAACTA ATTTTATCAT TTGAATAGAA ATTGAAGOCr TTGAGGGCAT 
31951 GAAGTACTAT I I I I M IUL TATACTTCTT GIIIIIGbll TTTCCTTCAC 
32001 CACCGATTTA TGACTAGCAA AAG00CACAG ATCCTCTAAT TAACTCACAG 
32051 CTGOCATCTT GGCACGGTAG ATAGAATTCA GGAGCTGGTG ATGAATGGGG 
32101 AGAGAATAGA AAAACTAATA GACTCTAAGA ATTTTAGAOC TCTGTGGGCT 
32151 TGCTGAACAA TTAATOCAGC aXTTCACTT TACAGGTAAG TAAACAAGTC 
32201 GCAGGGAGCC AGCAGTGAAC CTCTCTCTTA GG^CTTCAAA TCTAGTGCAC 
32251 ITTCTCTTAT ACTTCAAGGA GAGAACTGGA GGGAGAGGGA AGOCAAAATA 
32301 CTGACATT7T AGAGGCTGCT TTTTAAGAAA GGATAGGACA TTGGACTTCG 
32351 TCTOVTTATC ATCTTAAGTT ATAGAATTTT TAAAAACACT AGCACTOCAA 
32401 AACAAAAATA GGTAAAAGAT AGGAAGAGGC AGTTCACCAA AGAAGAAATA 
32451 CAGTAGGCAT GAAAGAATGC ACAACATAAT TTAAGAAATG TAAATAACAA 
32501 ATATCTATTC CTATCTCAAG AGAAAACATT TACTATGAGC TTTATTATTC 
32551 AAAGCATTOG GCTCATAGGG LI I U lb I I G ACAATCTACC CTAATTTTAG 
32601 GAAATGTAAT TATAAGATAA AAAAATGAAT GCAAATCATA AAAGACGAAG 
32651 TATAA000GC TCAATAAATT TTCTCAAAGT TATAGTAGTT TTAAAGCTCT 
32701 TTTTATTCAT TCCCTOCAGT TCTGTCTAGC AOCTCTAAAG ATCAATTATT 
32751 GGC TOGG TCT GGTOGCTCAC GCCTGTAATC CCAGAACTTT GGGAGGCX3GA 
32801 GGOGGGOGGA TCAOGAGGTG AAGAGATCAA GAGCATCCTG GGCAACATCG 
32851 TCAAAGOCTG TCTOACTAA AAAATOCAAA AATTAGCPGG GCGTGCTGGT 
32901 GCATOOCTCr AGTOOCAGCT ACTTGGGAGG CTGAGGCAGG AGAAI ICCl I 
32951 GAAACTGGGA GOTGGAGOTT GCAGTGAGCC AAGATOGCAC CACTGCAGOC 
33001 CAGCCTGGCA ACAGAGTGAG GCTOOGTCTC A AAAAAAAAA AAAAAAAAAA 
33051 AGGTGAATTA TCTOVGGTTT TATATOGCAT TGATOGAOCA CTATTCAAGG 
33101 AAGCTGTCAA GGACTGAAAG CAGTTCAGAA TAGGTATOGC CAGCTAATTT 
33151 TACTTGCCTT TGCTGGCACA CACAAGCAGA GAAGCTOCT1 CAICTCI I AC 
33201 TCAGGAAATA TTTTCATCCT TATTCAGAAG ATTTTOCAGG ATGCTCAGGA 
33251 AGACAAAGOC AAAGAAGAAG AACAGTCAAA GTAGGAACAG GAGCAAAAGT 
33301 CTGGACAGTC ACTOGAAGGT GTCATGCTGA AGAAGGGAGG GATOGGACTT 
33351 TGAAATCAGG CCAAGTOCAT TTCAGTAACT GAGTGGGTTA TCTCTTCTTC 
33401 GCAAGCAATG TCCCATAOCT TTCAAGGGCT AAGCTAGGGC AGGAGTTCTG 
33451 ATAAGAGCTT TGAACTTAAT ATOGCTGACA GAAAAATAGG TCTCCTCAGfc 
33501 TOCTATTTOG AGGTAAAOOG GOTAAAAGTG ATAAATAGTA TTGCTTTTTA 
33551 AAGTTCATTA CATATTTTAG AGGTGAGTCA GTCAGAGCAT ATAGAGATCC 
33601 TTCATTTTAA TOCTCTGTAG CAAGAATTOG CAAAAATATT CTCTCTAAAG 
33651 AGCCATTTAG TAAATATTTT AGAGTAGGCT GCATAGTCTC 1 1 1 IGCAGCT 
33701 ACTCAGCTTC ATAGAAGAAG TAAACAAATC AGTGTGACTG TATTTTAGTA 
33751 AAACTT7GTT AGAOGCTAAC ATTOGAATCT TACACAGTTT TTATAIGI I I 
33801 TCACATATTC TTTTGATTTT TTOCAAGGAT TTAAAAATCT AAAAAGCATT 
33851 TGTACCT1UT GAGGTCTACA AAAATGAGCA G7GGAOCATC TTT7GCCTGC 
33901 AGGGCACATT GIGORGATCT TTACTCTATA GCATCACTGC CAGTAGAAAT 
33951 ATAATGCTAG CXATATGTTA TCGACTGAAT Gl I I G I A AOC CXjCAAAATCC 
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34001 AT A I ITTCAA GCTCAAAGCC CTGGTC1GGC AATATTTGGA GATGGGAOCT 
34051 TGACAGATCT ACTTAAGG7T AAATGAGGTC ACAAGQGTOG GGTCCTCGTC 
34101 AGVTAGGATT AGTCTCCTTA CAAGAAGAAA CACCAGATAC ACPCATTCAG 
34151 AAAATGCAGA ACATTCTAAC CTTACIGbl I TAGAAAOCAT AATACTOQCA 
34201 GQCTTGCTOG TCTTCATTTT TATTATTTTC AATGTOTCT TTCAAGTATT 
34251 TTATCTATCC AGAO0CAATT TCTA7GATAA GGGAATTTCT TTCAAACTAT 
34301 TAGGTGAAGT OTTAATTAT GAAGTTOGCT TAAGCCATTA AGTAGTATTA 
34351 ATGGGGACAT GCATCTTAGA AATTAACTQG AAATGTAGAC TAGAAATATA 
34401 AAAAAGTGTC GTtATCACTA ATGTCTCTAT CCATTTOC3CA AAAAAGAGAT 
34451 TTCOGTATCT GCTCACATTG CAGAAOCACA CTGAGACTCA GGGA AACATC 
34501 CTGACGPCAT PCI P0GAACA TCCTTAGTTT GGATTTCATC AAACAI I 1 1 I 
34551 TTCCATTTCT CXTTTTTCTG IGI I IAGTAA TCCTTTCTTG GATTCTTCAA 
34601 ATCAGAGGCA CTTATTOGAG 1TCATAAATC GCAGTTCTTC ATCTCACTGT 
34651 TTAGAAATTT AACTTAGCTT ATOGACTAGT TCAGGCTTAG ATAACTCTOG 
34701 AAGTCTTTOC ACTTTCAAAT ATALI Mill ATAGTTCAGT TTCTAGAGAA 
34751 TATAACATTT TATAACTTAA ATGGAAGTAA AGTCAAATOC CAAAATCCCA 
34801 GTTGATTOG TTA A TGTCT G CATOCCTOGA TPGCAGAATT OGAAAAATAT 
34851 CAGACATAAT GACTTTTATT TCTATTTTAT ATTATTAGAT TTATCAGATA 
34901 TAAA A I IGI I CTTTTAAACT GCTAI I ICIA AACACTCTTT 01 1 IU 1 1 AG 
34951 TTAT U I GIG AACTGGTAAT AGTTTTCAGA CIGGTOGCTG TCCATGAAGC 
35001 ACTTTCAGCA TTQGIOCAGA TCACTTTCAC TTATAOCATC AGCCAGCAGA 
35051 TACTOTAGAT T A I I PG GCTG MM IAAAGA GAAGATCATA TCATAGIGI I 
35101 GGTTATTAAT ATTTCAGCAA GATATTAGGG ilGTCTOGAT GTCCCAATCA 
35151 TTCTOGAGTG CTTAGTGTGT Gl I IGI MM TAAGAATTGA CTTOGAAACA 
35201 TTTATCTTAA GGTCTTAGGG TT A1 1 1 1 I A T TCTTTAGTAC TAAOGGTATA 
35251 AACTGAATTC CACTCAAATA GAGCTTATTT TCATGTATAA MM IGAAGT 
35301 T AI MM I A T AACTCIGTGG GTAGATAGCA GGCAAGTAGG CACTTTATTT 
35351 GGCTAATAGA AAAAAATATA T A I M lib i t. AAATATTTCT G7TTTATTCA 
35401 TTCATTCAAA GTATATTTOG AAIGI 1 1 ATT TOCAGGAAAT TTTGGAATAT 
35451 ACAATACAAC CAG CTTC1 IA TAACTQCACT TTAAGTGAGC CATAGGTCAA 
35501 ATAATCAOCA GCAAAATCTA ATCACAOGTG TCCCTCTTAC TOOL IGI IGG 
35551 AGGAA7TCAG GCACTCTOOT AACGCTGTAG GCCTGGA7TA GTOCAGTTCA 
35601 TTGGCAGCAG CATTATQCAG ATTTTATTCT GGGGGGGAAC GGTGGCTCAC 
35651 AGGGOTAATC OOGGCACTTT GGGGGGCTGA GTTGGGCCTG MGLIIGAGC 
35701 GCAGGAGTTC AAGACCAGCC TGGGCAACAT AGGAAAGGCT GTCTCTACAA 
35751 AATATATAAA AATTAGCTGG GOGTGGTOGC GIGTGGCTGT AGTACCAGCT 
35801 ACTPOGGAGG CTGAGGCAGG AGGATCAOCT GAGOCCAGAA AGTTCAGGCT 
35851 GTGGTCAGCT OGATTATGC CACTGCAOCC CAGCTTGGGT GATAGAGTGA 
35901 GAOGCTGTCT TAAACAAACA AAAGAGATTG TAI IGIGI 1 1 TGAAAAACAT 
35951 AG 1 1 1 1 IAGA GAA A TTCTXjT GTTATTCTTC TATTTOCATT ACAG CTTOGT 
36001 TTAAAGAAAC TATGGAAAGC TACAGACTAA OCTXXTTTTC CI MGGM I I 
36051 1 1 I I O GATCT TATAGCAGAG AAOCAGOCAG GGOCAGATTC CTQGGCTTCT 
36101 GTCCrcTGCA GTTGCACAGG GCTCTGCTCA GAAGGGTOOC ATACTTGGCT 
36151 TAATGCTCTG CTCTCAGCAT CTTGAAAATT TTTAATAATT TTATATTTCA 
36201 IC I IG I I I IG TAAGTTATAA GCCCAGTGGG CCAGTGGAGG CIGI ICIAGG 
36251 GGTCTQGGTT TCGCTCATCA GAGGTACATC CI ICJULAGCT CCCCAGGATG 
36301 GGCPCTCAGC AGCTOGCTTT CTCTAGGCTG GCACCCEAGG GGCTTCTTOG 
36351 COXTAGOOC G I GMCA CTG TTGCCCTCAC CTTOGGTGAT GACTGGGTCA 
36401 CATOGGAGAG GAGGAAAGGC A I GMC I G CT GTCAGGCPOC ACCICI IGCT 
36451 GGGTOCTGGG TACAAGCTCA GGGAGGGTTC GGATCAGGGG TCTT7PGACTC 
36501 ACAGIGI IGI GGGGAGTGAT GGTGGTCATT TTATGQCAGG CTGGCAGOGT 
36551 GCTGGCATAT TGATGAGCAG CTGCTGGCAT GA7GAM IGI CCTGAOXTC 
36601 GAG0GGACTC TTjTATCTTGG CTTTAGTCTA GTOCACTTCT CAGCATT0CT 
36651 TOGAAGOGCA GCTAAACTGC AGAGAGAAAG AAGAATATOC TTTOGGATTT 
36701 TAAAGATGGA ACCOCATATG CTATCAATTC GAAAGGAGTT CAGATfCTGG 
36751 CTOGGCTGAA TCACTGATTT TCCAGGCAAG TOGAACTTTA ATGACCCACT 
36801 AGTGGOCTTT GTTCACTCTG GGCTGGCTGG GGAGGAAGGC I IGCCI I IAA 
36851 AGAACGCrer GLICI 1 1 AAA GTTTCCAGGG CAGAATTCAG ATCCTTCAGG 
36901 TAGAGTTCAT CGATCCAGAC CTAGCAGCTG CTTTAGGTAA GGGCICI IGI 
36951 TA G PCI IG I TCTTATOGGG I ICI IATCTT CAGCTCAGCT TOCIGCI IOC 
37001 TGAGAGAGAA GTGGrAGGTT CIGI I MAGG CI 1 1 ICI I I I TTOGTGGTGG 
37051 TGGGAGGTOT CAAAGCGCAT TATTCAGGGG AGTCGGGGTG GTAAAGGAGT 
37101 GGPGAGGAOG ATCAGAAGGG ACCTOCTTAG CCAGGOCAGG AGCTCAGCTC 
37151 CCAGAG0OG CCCTPCAGGC ACTGAAAACT GnTTAGGCT 1 1 ICIGCA7G 
37201 GTTGGAGTTT ACTCTTAGTT GCTGACAGTC ACTCATCOGC TCGAGAGGGG 
37251 TACPGAGCAT AATCACTGTT GGTATTTTAT TTACTACCAG TTCTTCATCA 
37301 TTAG0GAGAC TCATTCGATT TGGAAAATAA ACATTGCCTC AICIGI I CAT 
37351 GGTATATTTT TACACTTGGC TTTCACTCAC ATCTTCCATA TTAACTGAAA 
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37401 AGATOVQCAT GVTC1 IGGAG GCTMTCTCA TCTX3GATCTT ATTAGCATCT 
37451 ATCTTTTTA T CGTCAGGTCC TCCTTTT3GAC TCTQQOCCAT TCCTCCCACC 



37551 CTTTTOGAGT TTTAI I ILI I CTCTAACOCA CTCTTOCCTT CCCTCTCAAA 
37601 TATAATTCTC TCATCAGCAA CTCTGCCTCA TAATOOCAGT GTATAGGCCA 
37651 TAGCTCAGGT TCTTTTAATG CTTGCAAGGA AOTCTCTTCA GATTACTACA 
37701 GGGTAAAAGA ATCCATTCGC TTTCAACCTC TCATTACTTA CTGAATCAAT 
37751 TATTTATTTA CI PGTCTAAT TCACATCCAT GCTCAGGTAT GTC1 I I A GTT 
37801 TACCTCTGAG ACTCTOGCTC CCTXTAGACA AGGGTCTTTC TCTQOTATTA 
37851 GTOCACTTCC TCAGTCTOA ATTTCATGAG GAGAGAATCA GGGCAAGACC 
37901 TACPGGCTAA GACI IGCIGC TGG CTCAGTT ATTTGOTTTA CTCTTTAAGC 
37951 GCTTCAGGCA GGAGGTGACA GTATC1 1 I VG TGTTCGAAGT CAGATAAGCA 
38001 GTAACAGTAA TATTCTCATG TAGGTCTAAG GGGCAATAGG AGGCTCAATC 
38051 TGAGGGOCTT GGCAGGGATC GTTTOOCTGG GTTACTOGAT TTGAGTCTTT 
38101 GTC1 nOOOTA ATAGTAACTT CTCTGACOCT TGGCTCAAGG AGTXXTTTCT 
38151 AAGCTAAA7G GCCTCCTCAA GAAAG1T3CTG ATATTCATCA GAAAAAAAAG 
38201 TAGGOTTTC1 GTTATCTTAC CATCTGGGAG ACATTGGCTT TAATCCTCTC 
38251 GGCTCTTOT TTCOTTTAA AATTTTCAAG AATCGGCAGA ATTGAGGGTC 
38301 AATCCTOGAT CAGAAGGGGC AGAAAACTCA GCCAGTCAGA GGTGAGAAAG 
38351 GQGTCTTCTO TTCCAGTTCC TCTACTTTAG AGGGCTTGAG GGCCGCATTA 
38401 GGAGCAAAGC AAATACPCTG AGOTGCTAGG AGAAGATTOC TCTAAATCCA 
38451 GGATGATTTC ATCATATGAC TTGAGATOGA GGGGTTCTTC ATTAGTAATG 
38501 GGCTTTTTCA TTGGCAGGAA GTAQGTAAAC TCCAGGCAAT AGGATPGGGC 
38551 ATAGAAATAT AGGGAACPCT ATGCTTTCAA CTC1 PGTGAA GTTAGGAOCA 
38601 AAOCAAATTA TTTGGGGTTC Al IGGIGGCT AAGAAATTAA CTTTTATACT 
38651 TATACTTAAA AAAAAAAAAC TTAAAGAACT TTTATTAATT CTTCGGCAAT 
38701 AGAGTGATTT I 1 1 1 1 I 1 1 1 1 TTAAGAGAGG GAGTCTGGCT CTCTTOCTCA 
38751 GGCTGGAGTC CA GVCGVCC A ATCA7GGCCA GGGCPCACTG CAGGCTTGAC 
38801 CRXXAGGCr CAAGPGACOC TCGCAGCTCA GGCTCCTGAA GTAGCTAGGA 
38851 CTAGAGGGGT GCAOCAOCAC AC30GAGCTAA I II I IGIATT 1 1 I IGCAGAG 
38901 ATCAGATTTT GTCATCTTCC GCAGGATAGT CTOCAACTOC TGGGCPOGGG 
38951 GCATCTGOOC GOCTTGGCCT TCCAAAGTGC TGGGATTAGA GGCATGAGAC 
39001 ACCATOCCCA GCCAGAATGA TTTTOCTTTC TTCAATTCTC AAAATTTTAT 
39051 TCTX5GCTTAG GTAATTCAGT AGGAATOGCA GAGAGTAATA Al 1 1 ICAGTG 
39101 TAATCTTTAG CATTACATAA AGCTCTAGAA AGATGAAGTA AAAAGTTCTT 
39151 TACAGACTTT CTATAGGAAA AATAAATCTT TATAGATCTC CTTATCAACT 

39201 Grrrcnun (xtagctctg catttpcatc tattcataac tctcagagtt 

39251 CAAGGTCAGT TAATTOTACA 1 1 1 PCTGGGA Gil IIICICA CTGTTAATTA 
39301 GTAAGACTTT GAATCATTAA GTG C T G ATCA CTACATGGCA CAATTCATTA 
39351 AAGAGAGTCA TACTTTAAOC TTAGTACAGC TATAATGGGA OGGTCAAATA 
39401 CAACTAGGAA AGTATCTTAT TAAAGTATTT TACATTTTAC TAATAAAATT 
39451 TATTTAAAQC CTGAATTAGT GTTT AI I 1 1 I G1 I IGOGAAT TCTCAGTAAC 
39501 ATCTCACTAG ATOOGTGAAT OCTCGCAAAA GAGCTATTTT AATOCAAAGA 
39551 ICI I 1C1G CA TCTATAATTT AOCACACTAA AGCTCACATT ATCATTAAAA 
39601 GCAGCATTAT CATTAAATTG TAGACATTTA TTACTTCATC TCCAGGGCTA 
39651 ATGGGAGK3G TTTOGAGTAA ACCTTTCTAG AAACAATAAT ATC1 II PGAT 
39701 AAATCGATAT GGGGGPGGAG TGGTTATCAC TTACCTCCTC ATGAAATTTC 
39751 TGGAAAGGTC ATCOGGAAAA CCAGGACACA TTTATCGTAA GATATAACAC 
39801 CTTAOCATAA C3GATTAAGGC TCATTCAAAA ACTCTGACTT AATGAGTTTA 
39851 CCAAAAAACT Gl ICIATOCA CATCTCATCA AACCG000CT GAAATTOCCT 
39901 TGCCTCTGTT AAAI I I I ILI AGACAGTCTG AATAGAGGCA TGTAAI I 1 1 I 
39951 TTGGATTTTT CTCTOGTTAA ATAAATATCC TTTACAACTC TCTTTATTCT 
40001 TCAATATOCA TAAGAGTTTA TATTTATACT GTATCTTTC1 TATTAGGATT 
40051 GCTTTCAATT GCTATATAAA AAATCTAAAG TCIGI 1 1 ACT GGCTTAAAGC 
40101 TTC TO GTCT A 1 1 1 1 IATATA AAGTAACAOC CTTAATTCTA ACTTCGCCAA 
40151 CAGGTAGGAT GGTATTATTA TTATCTTCAT TGTACAGATA AGGAAACTGA 
40201 GGCTCAGATT GACTAGATCA AACAGGAGTT TTCTOGAAAA CCTAGGACAC 
40251 AAGCOAAAT CI I IGAACTC AAATACTOCT CTACACRGAA TTACAGTTAT 
40301 ATACTCATTT CTGTTCTAAA I ICI IAGAGA AGACAGACAT AGAAATTAGT 
40351 AACTTCAGTC AGTAGOGGCT I IU ICAAAC ACAGGGACAT GCATATTTTA 
40401 TGGTATATGT TTATAICTGT GTAATACTCA TCATAAATCT CAGATTTATA 
40451 ATCGATAGTG C3CCAI I ICIA AATTTATAGT TCAAAGCTCT GATAGGAATT 
40501 AGGAATTATC TTAAAGTCCT AGAATAATAA ATATTAAGAT TTnGAAGACT 
40551 GCTTAAAACA GTGTTOCAGG CTGGATTTTT TOCTTACTTT rTTTTTOCTI 
40601 AOTTTTTTAC CTTAGTTTTT TCCATCTAAT GAATGAGGAG CCATTCGGGT 
40651 TOGTCTTTOG TTCTAAAGGG CAGGTTGACT TCACCAGGAG GTOTTTCTTC 
40701 GTATTTATGG ATOCICI 1 1 1 TCACACTAAT CCTTTCATTA GCI ICI ICI I 
40751 ATAGTTCATG CTTCTACAGT TCCAGCTCAA TGGTAAGTAG GTAGAAATAT 



37501 IGIC I I CA TC 



TGATCICIGC CTGGrnOCAT AGTCACTTCG 
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40801 GCATTOVCTA ATC1 RGAACT ATATCTAGGA QGGTCATATT CATCCTACTA 
40851 CTGAGCACTC TCAGOAGOT GAAGTCTGAG ATTAAAGAGC TTTATTCCTT 
40901 AGCA IGMIC I I 1 1 IG ATCT AGTTAGAGTC ATCTGAGGCA GTOCATOVCT 
40951 TTTAGGATCT TATAATATAA TCCACATACT ACTTTATACT TTTCAATTTC 
41001 TTTTTATATA TTTTTCCTTA CTATAAGGCT GTCAA6TCTA GPGAGATGAC 
41051 ATOOTTATTA ATATATTACA AATGAAGAAC TCTCACTTCG AGOTAGTATA 
41101 GCACTCTGGT TAATAATAGA Ad 1 1 CI TATA I I 1 1 IATATC AATTATATAT 
41151 TATCAAGCAG AGCAGAATAC TAGCTCTAOC AATTAOCTAG CTGAGTCATC 
41201 GTGCAGCAAC TTAATCTCPC AAAGOCCTCG TTTTCTCATT GTAAAATGGA 
41251 GGAGCTTATA GGAJTGTTGG GTAGCTTAAA TGAGATAATA GGTAGAATTA 
41301 AAAGTAAGQG CAAAAAO0QC AATTACTTTT GCAOCAACCT GATATTTAGA 
41351 ACAGTOGAGG CATATAGTAG GCATTCAATA AATATTCGTA GTATTATTTT 
41401 CAGAGGTTAT TACTCATTTA ACTCAACAAA AATTCAGAGC O0LI I ICIAT 
41451 AGGCTAOTCA TTCTGCTAGG GATCAGGAGC ACAGGGCAAA CCTCGTAGGG 
41501 TGCTTACTCT GATOGAATTT ACATTTTACT GGTAGAAATG GTAAATGAGT 
41551 CAACAAGAAT TTTATGGAGT GACAAGGACT TTGAGGAAAG TAAAAOGGGA 
41601 CAATGCTTGA GAAGGGTCCT AGTTGGGAAG TCCTATCACA GGAAGTOGTA 
41651 TTTGATCTGA GACCTGAGTG AOCATAGOCA CATGAATAOC TGGGGGAATC 
41701 ACTTCGCAGG AGGAAACAGA AGCACAGAAG G0GAAAGGGA GGATCACAGG 
41751 OGGCACATTT TATTGGTCCA GTCTCGAGCT AGAGGGACTC GATTGTCTAG 
41801 G GTTTCTA QG GCAGGGTAAA GAGCTTGGAT I ICI I CI GAG TCTAGGATTT 
41851 TGAGCAGGGG AGTGATGTCT CTTATTTTGG GOGGGGGCTA GGOCAGGTAC 
41901 TCTGAAGGAA ATAATOGTAT AGAAGAAATA IWCI I IC1 1 GCTAGAATCT 
41951 TATCPGAGAT ACAACTAG7G GTOGCTCATC AGCAATTAGA ACTGCAGTUT 
42001 TCI I GLIG GT GTACTGCTAA TCTGATCATT GGATCATATT TATCAGGTAT 
42051 TTTTCGVGCA GTCATCTCTA AAGTOGCATT ATGGTTOAG TTATCTATCA 
42101 TTGGATCTTTr TTOCTCTAAC AOTG0GAAGG GGCATGAAGG GTGGAAACAG 
42151 AGTAGAQQGA AATGGGTGAA AAGGCTATTT CTAAAGACTA GACACAAAAT 
42201 AGGGATTAGG AAGTAGTOGG GGTTAGGAGA AAAAGCAAAA TOLIGI Iblb 
42251 CATOGAGAGG GAGATTCTTG AAACTCTGGC I ICI ICAGAT TACATGAGTG 
42301 AAGCTGQjTG GACTGGGTOG ICI IUXTAT TCAGCAAGCT GAATOCAGGC 
42351 CCAATOCAAG ATTACTGCTG CCM IGCICI GAGGCTTTAG AC1GI IAGTA 
42401 GCTTTTCTOG TCTOACFGA GGGCAAAAGG GGCAGTGATA CCCTTGAATT 
42451 I ICI I C I I A A AACAGGGTTT CATTTXXTTG GAAGI I ICI I ICI I ICAATC 
42501 TTTCTGTCAG TTTAA C I U I ATCATOWTT GGTTAGCATT CTAATAATAA 
42551 TTATAATTAT AGTAAACATT TATTCAGTCC TTAGGAAGAG GCAGTTCCAA 
42601 G C II M I1 A T CPOCATTATT CTGCTACTTT GLI ICICATT TTACAGATGA 
42651 GGAAAATC4G GCACAGAGTG GTTAATTAAT CICI I IGAGG TCGGGTAGCA 
42701 GGTCAGTGAT GGCAGGGTTC AAAGCTAGAC TTAACTCTAC ACTAGAGACT 
42751 Gi l I I CI IA A TTAI I I C I l ' C ACAATCATAT GTTTAATCAT TACTTATTCA 
42801 TTATTTAGTG GTCTGATAAG AAGAGGGAGC GGTX3CTCTTC PGTTGGAGAA 
42851 GAAAGGCTGG CRGATCAAGA GAGACTOGTT GGTTTCAAGA AAAAATATAG 
42901 ATGTTAATTC GATAAGACCA CACTCTAAAC Al I ICIACTG GAOGAGTTCC 
42951 AGCTGTGTGC CACTCGAAGT OGGATGCAGT AAGGAAGGCT TTTTATTCAG 
43001 GAGAGAACGA ATAGCICTGT ATTCAAAAGA GAGIGIGI HG TTGCTTATAG 
43051 AAGATOGAAG GGGGCTTGCC AGTGACAGAT TATGATGATT ACCTTjCTTAG 
43101 TOOTTTTTTT TTATTGGACA GACTATAATA ATAATTATAA AAATTTCTAA 
43151 ATCAAGAAAA AICI I IA 0CC CAACAAATTC TTTTTATATT TTTCATATTC 
43201 AT A I M I I I C ATATTCATTT TAI 1 1 I I ICA TATTCAAAAA TAAAACAATT 
43251 TTCATATTCA TTTCAATPGT TTAI I II I IC ATATTCAAAA ATAAAACAAT 
43301 TTTCATATTC ATTTCAATTG TTTTAI 1 1 1 1 TCATAI 1 1 1 1 CATATTTCAT 
43351 AM 1 1 1 1 CAT TTTTATAGAC AGTTATAGAG TATCTACTTA I II I ICATTT 
43401 AACATTATGC TATAACAGGT TTTAAAGTTA GGCAGTAAGT CI I ICCAGTG 
43451 AAAAGGAAAA GACTCAAACA TCAAAAGGTT AOGAGGCATT ACTAATTCAT 
43501 TGGAATCATG CCTAOCAGGC AGCCCATGTC AG I 1 1 IGGAG CTCGCCTTCG 
43551 AAGGAACAGC IGTGTATTTG GAOCTHGGAG CAGICTCTCA GGCTTCCAAG 
43601 AAAGTGGGCA AAGAAGGAAG GGGGGTTGGG GGCACAAGTG GAAAAGTACA 
43651 GTQGTTTGGA ACTCATCTGA AACTATCAAG CTGGTTGAGT TCAGIGI I AG 
43701 AAAATAGGAA AAAGAAAACA Al 1 ICIACCT GATATGGTAC TTTAATTCAG 
43751 GTTAGCACTT CAGTGAATAG GCAATCGATT ATAAI I IGIC AGAATTATCA 
43801 AT A 1 1 I I C I I CI IGICA GTG GCTTTATATT TAATACTTAC ATGTTATAAA 
43851 GCCA GI 1 1 1 A TCAGTGAAAG AAGAAAGTTC TTCAGAAATC CTATTTTAGT 
43901 GATAAAGCTG TTAI ICIGTA TTTAAATTCA GTTTAACTCA AAGTGGTTTT 
43951 TAAGTTTTAC ATTTOTATAA CTCTAGACTA GCCATATGGC ATTCAAAGGC 
44001 CTGCAAGATA TAACTTGAGA CACTtTTAGG ATGGAATQCC ATATTCAGAA 
44051 ICI 1TA CTGA CAGCTCGCCA TATCCAAATT GG C I 1 1 IC I C ICI I U 3GCCT 
44101 CTCGGTGTGA CAAGGGACAA CTTCAGTCAA AGGGCACAGG ACAAAGTAGA 
44151 GATCCTGTTA AATCTCAATC CAGAAGGOCT GVGACAACPG ATAGAGAGAA 
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44201 i psi i rcrcr 1 1 ILIOOCTC ATAAAATAAG taaaattcaa agtgaiiiif 

44251 TTTTAATTTT TTCCTTTCAC ACATTGTGTT TAGTCTGATT GAGGTCATCT 
44301 TACACAACAG AATOCAAAAT CCTGAGAAAA TAATTTATAC AACTAGAATT 
44351 CCAAAATTAG Gl I I IGCATT AAGTTCAAGC TATCATTTCT TTAAGAAAAC 
44401 CTACACTGGA GTTACCAACr QGAATATTTC GTTATCAGAC ACAGTCA7CA 
44451 7GTCAGAGTA TCTAG7TTCC TTCTAACACT TATTTCTGTG OTTTAAGAC 

44501 crecroccre ocacacttta tcataacaaa aaacaagaga cacatcttag 

44551 TACTCTAQCA TCTATAGTCA TOGGGAATCT GTTOAAAOG AAATCOCATC 
44601 ACAGTOGAGT CAACACAATA M IC I A GQGG AGAAGGTCTT OCTCTTAAAC 
44651 TCATCTCAAG QCATTCTTCT CTACTATOQG AACTTCTTTA TCTGCGCTCT 
44701 AGAAGACAQC TCAGATOGTC TTCAGTGAAT CIGI ILACTG ACAIGI I GOT 
44751 AGTTTCTTTA TCTTTACATC TAAACAGTGC TTAGAAGCAT CTTTCGCCAA 
44801 G CTSnCm TTTTCATCGA CTOOCCOTT TTQGGGGAGT ATCAOTTCAA 
44851 TCAGCACTOT TCTACTCTCA GCACTAGACA TTCTOTTATC TGTCTCTCGA 
44901 GCTTCCTTCT GGCTTCCTAG CATTAGCTCT GCATTACATC CTOCAGGCTC 
44951 ACCAGGCAGA AGGGGCATTC ALI ILIGLIG CTAOTACOV GCAOCTCTOT 
45001 T TCC P0 G AGT GIGRAOOCT CCCACCACCV GGG0C1TGGA CCTGGGTCCG 
45051 TTCGGCTATT ATOCATOCTC GCAGCTGCCC A1CICIGCIC TCATATAATC 
45101 TACTAGGATA ATAAAGTAGC TTTTCOCTAC CATAAIGI 1 1 TTTAAATAGA 
45151 CATAGAGCAA TCTATAATOG TTAAGAGCAT GATCCAAACT GTGTCTGATT 
45201 TTGAATCCTG GCTGAGTTAC TAGCTATCTA ATCTTCAGAA AACTACTCAA 
45251 TCTCTCTG1G OCTTAATTTT CTTAGGTATA AAGCAGATAC TAATTATCGC 
45301 ATCATAQGOT AGGTATCAGG ATTAAATGAG TCAGTATTTC TAAAACACTT 
45351 AAAACAGTCA CTGACTTOGG CTAOCGCTTT TOGGH000CT C30LI I IGI AT 
45401 GGGAGCTCTG I 1 1 ICACTXT ATTAAATCTT GCAACTTCAC ALI LI ILCAG 
45451 TCTXjTCTTTG TTATCGCTCA AGCTCAGCTT TOGCTOGCTG TCCAGCACTG 
45501 CTOTTTOCre GCATOGGAGA ajCGOOGCPG ACTTCCAGOC CTOGGATCC 
45551 GGCAGGGTCT OCACTOGACC TCPGGTCCAG OGAGGTGGTC aXATTCCGG 
45601 CTGGCAATCG GGCTAGGGGC TTGCCATTCT TCCTGCAGGG GCTAAGTOCC 
45651 CTGGGTTCAT GCTAATTCAG CTGAATAGAG CTGTAACACT CACTCTATCG 
45701 GCCAAGGTTC CAI P0 CT TO G AATCTGTGAG GGGAAGAAGC CCAGGTCAGA 
45751 GAAGAAGAGG CTTCGOGCCA TCTTCGAAGC AGG0CGCCAC CAI LI IGGGA 
45801 GCTXTAAGAA CAAGGAOCOC OOGGTAACAT TTTGGGCAlGC ATCAAGGGAC 
45851 TTCCAAAGCG GTGAGTAATA TCGGAGCACT TTTGCTTGCT GI ILIGCCCT 
45901 ATTCTTCATT GGAATTGGAG GAAAATAGOG GGCACCTGTC AGCTGG1 I AA 
45951 AAAGAATTAG CATGGGCACC AGACTTAAGA CTCAGGTCTG AGGCTGTCI G 
46001 GGGAAGGGCT GTCTAAOVGC O0CCAAO0CT TCTGGGT7GG GAG0GTTCGG 
46051 C I G I CTGGAA OCAGCTTOCA CTTTCAGTTT TCCTGGGGAA GC7GAGGGOC 
46101 GACTAGAGGC AGAAAGCTOT TCTOCTGAAC TCCIGGIGI I AGCTGG1 IGA 
46151 GATCA7GGGG GAGOCAGAAG TCTUTAGTCA ACAGTCAGOC ATCOGTGCAC 
46201 GGCTAOCnT OLI ILIGAOC TATACLILI I GGTCCTGACC ACAACTTCCT 
46251 TCAAAGTCTA GGGGGAAAAT TCTCCTTACC TCTXjAATCTA CI PCCTCCAA 
46301 TDCCTGCCTC CTAGGTACTA ATCGTTCAGA GCTTCATTTC CTCTAGCAAG 
46351 CTCTATCPCC AAAGGGATCT AAGGAAGCTC TATOCTGTCT CCTTAAGGCC 
46401 CTAGGCrcre AAGOCAGATA GTCTTGTO0C TCGTCTOGCT GGCAATTTAG 
46451 GCATACAGCT CTCAACATGG GCAGTTATCT AGGAGLIGI I G00CAOCATC 
46501 CrnGCCAGGG TCCCAGGTTT GTAAAGGGCT AGGAGAAGAG AGAGAGAGAG 
46551 AGAGAGAGAG AGAGAGGGGA GAGAAAGAGA GAGAGAOGAA GAGGGAGTCA 
46601 AAGAGAAAAA GAAAGAGAAA GATAGAAGTA GTAAAGAAAA AATAGTCTGC 
46651 CCTATTOCTr TAAAAGQCAG AGTAAATTTA AAAGCTATAA TTGATAATTC 
46701 AAGGTCTTCT GCTGAGOCTG TAACACTOOA ATAGCACTTT ATTCTCAGTG 
46751 TAAACAAGGG GGTAGOOCAA AAACACTCAG ACCACTGACA ACCTATCAAA 
46801 ATCCTATCAA AAATCCGTAA OOCAGTAACA CGTGGATOGG GCAAAGGCAT 
46851 TCAGTOQGTA GCGGCAACTG CI I IGCTAAA AGTAGAAAAA TAAOTTAGA 
46901 GGAAAOCTCA I IGIGACTAC ACCTCAOCAG TTCAAAACTA TCCTAAGTCA 
46951 AAAAAGCAAA AAGGTAACTT ACTAACTOGA AAATCTTAAA ATATCGGGCT 
47001 CI ICIGI I AG AATAAAGGTA ACTTATTAAC CACTGAAAAT TOOCTTAAOC 
47051 CAGCAGATTT OCTAAGAGGG GATTTAAATC TTAATTAGCA TACAAAGGTC 
47101 GGAGAGATCT AGGAGGAACT OOCTTCAGGA CAGGATGAXA GATCCTTOCT 
47151 GOCAGGTAAT TCAGGGAAAA AAOCACAATG GGTATTTAGT AAT7GATAGG 
47201 GAAALILI IG TAGAAGCAGA GTTAGGAAAA TPGGCTAATA ATTOGTCTGC 
47251 TCAAACATGC GAGCTATTTC CACTCAGGCA AGCCTTAAAG IGI I IACAGA 
47301 ATCAAAAAAA CTCAATCTCA ATOCTGACTC AAAAGGTTAC CTACACCCTC 
47351 TCTGAAATCA ATTTCCATAA GAA L I G I I GI TTATGGGAAT GCAICI I GAT 
47401 GGGGCAACTG GCTTGTTATC AAATACTCAG GAAGCCAGGC CAGCTOCAGG 
47451 ACTCAOCCjCr GAGCAAAAAA GCAAIGI IGG GCACTCTGGT AAAGGAGGAC 
47501 TAGAATOCAG CAGOCTOGAC O0LI MCI II GTGGTCAAGA AAGGCAGGAA 
47551 AAGGGGTGCA GGACTGCTAC ATCAGTCAGT GCAACTAATC 7GATAAGCAG 
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47601 AGGTGCATGG GTOGTTA CAC AGGCTGGAAA GGAATAAACA TTAGGAOCAT 
47651 AGAGAACGCT CTAGGXCTAA TGCTCATTCG AAAATGACTA GGGGTGCPGG 
47701 CATOCCTATG ICJL I H I IT C AGATAGGAAA TCTTCGOGGC AAGGCAAAAA 
47751 TCGGCGTAAG ATATA1ICTG GAGAATTCQG AGCAATT7GA CTCTCAGATC 
47801 CTAAGAAAGA AATCACTTAC ATTCCTCTX5C AGTACCACCT GGGGATCATC 
47851 TCCPCTTCAA GGGGGAGAAA CCTGGGCTCC TGAGGGAAGT ATAAATTATA 
47901 ACAOCATCTT ACAACTAGAC CTCTTTTOTA GAAAAGAAGG CAAATGGAGT 
47951 GAAGTOOCAT ATGTACAAAC 1 1 I C — I I 1 1 LA TTAAGAGACA ACTTGCAATT 
48001 ATGTAAAAAG TATCATTTAT GCCCTACAGG AAGCGCTCAG AGTCTAGCTC 
48051 OCTAAOCTCA TCTOCCOCTG ACTOCTTGGC CAACTAATAA GGACCGGCCr 
48101 TTtAAOQCAA ACAGTCCAAA AGGACATAGA CAAAGGAGTA AACAATGAAC 
48151 CAAAGAGTGC CAATATTOOC TGCTTATCCA CCCTCGAAGC GGTGGGAGAA 
48201 GAATTOGGCC OVGGCAGAGT GCATCTAOCT 1 1 I I CI CI O' CACACTTCAA 
48251 GCAAATTAAA ATAGAOCTAG GTAAATTCTC AGATAACXXT GATOGCTATA 
48301 TTG AIbl 1 1 1 ACAAGGATTA GGAGAATOCT TTCATCTGAC ATOGAGAGAT 
48351 ATAATATTAC TGCTAAATCA GAGGCTAAGC TCAAATGAGA GAAGTGCTGC 
48401 CATAACTGGA GGOCGAGAGT TTCGCAATCT CnGGTATCTC AGTCAGGTCA 
48451 ATGATAGGAT GACAAOGGAG GAAAGAGAAC GATTCGDCAC AGGGCAGCAG 
48501 GCAGTTCCCA GTGTAGCPOC TCATTOGGAC ACAGAATCAG AAGATGGAGA 
48551 TTGG TCG QGC AGACATTTCC TAACTTCCOT GCTAGAAGGA CTAAGGAAAA 
48601 CTAQGAAGAA GCCTATGAAT TATTCAATCA TCTXXACTAT AACACAGGGA 
48651 AAGAAAGAAA ATCCTAOCAC CTTTCTGGAG AGACTAAGQG AGGCATTGAC 
48701 AAAGCATATC 7CTCTGTCAC CTGACTCTAT TGAAGGCCAA CTAATCTTAA 
48751 AGGAAAAGTT TATCACTCAG TCAGCK3CAG ATATTAGAAA AAAACTTCCA 
48801 AAGTOGGOCT TAGGOOQQGA GCAAAAGTTA GAAAOOCTAC TCAACTTOGC 
48851 AAOCTGGGTT TTTTATAATA GAGATCAGGA GGAGGAGGCA GAATCGGAGA 
48901 AATCGGATAA AAAAAAAAAG GGCACTGC1 1 TAGTCATGGC CCTCAGGCAA 
48951 GQGGACI I lb GAGGCTOGG AAAAGGGAAA AGCTGGGCAA ATAGAATCGC 
49001 TAATAQGGCT TOG PO CAGT GOGGTCTCAA GGACACTTTA AAAAAGATTA 
49051 TOCAAATAGA AATAAGOCAC TCGLI IGIGC ATGCCOCTTA TATCAAGGGA 
49101 ATCACTGTAA GGOOCACTGC GGCAGGGGAC GTAGGTCCTC TCAGTCAGAA 
49151 GGCACTAAGC AGATCATCCA GCAGCAGGAC TCAGGGTGGC TGGGGCAAGC 
49201 AOCAGGCCAT GCCATCAGGC TCACAGAGOC CTGQGTATCC TTGAOCATTC 
49251 AGQGOCAGGA GGCTAACTOT CICCTGGACA CTCGTUTGGC CTTCTCAGTC 
49301 TTACTCTCCT GTCGOGGACA ACraGCCTCC ATAICIGTCA CTATOOCAGG 
49351 ACAGOCAGTC ACTAGATACT TCTCGCAGOC ACTAAGTTGT GACTGGGGAA 
49401 CTTTACTGTT TTCACATCCT TTTCTAATTG TAGCTGAAAG GCCGACTOGC 
49451 mSTTA SGGA GAGACATTCT AQCAAAAGCA GGGGCCATTA TACAOCTGAA 
49501 CATAGGAGAA GGAACAOGCA I f lb I lb IOC GLIGCIGGAG GAAGGAATTA 
49551 ATOCTGAAGT CTGGGCAACA GAAGGACAAT AOQGATCAGC AAAGAATGOC 
49601 CATCTTCTTC AAGTTAAACT AAAGGATTCT GOCTCCI I IC OCTAGCAAAG 
49651 GCAGTAOQOC CTTAGAOOGG AGGGGCAGCA AGGACTOCAA AAGAI Ibl IA 
49701 AGGAGCTAAA AGCOCAAGGC CTAGTAAAAG CATCCAGTAG CX30CTOCAGT 
49751 ACTCCAACTT TAGGAGTACA GAAACOCAAC AGAOVGTGGA GGTTAGTGCA 
49801 AGATCTCAGG ATTATCAATC AGGOCATTCT COCTCTATAC GCAGCTGTAC 
49851 CTAATOGTA TATTOOGCTT TOQCAAATAC TAGAGGAAGC AAAblbbl 1 1 
49901 ACAGTOCTGG AOCTTAAGGA TOOL I IMIL TCCACTCCTA TAGATCCTGA 
49951 CTCTCAATTC TTCTTTOOCT TTGAAGATCC TTCGAAGCGA ACATCTCAAC 
50001 TCAlGCTGGAC I b l 1 1 I A 00C CAAGGATTCA GGGATAGGOC GCATCTATTT 
50051 GGOCAGGCAT TAGOCCAAGA CTTCAGGCAG TTCTCATAOC TGGATATTCT 
50101 Ib lbUMb G TATGQGGATG ATTTACTTTT AGCOGOCOGT TCAGAAAOCT 
50151 TGTGOGATCA AGGCACGCAA GTGCTCTTAA Al I IOCIOGC CAGCTCTOGC 
50201 TACAAGGTTT GCAAAOCAAA GGCTCAGCTC TCCTCAGAGC AGAGGGCTAT 
50251 TTATCGCTAA ATACTTAGGG CTAAAATTAT CCAAAGGCAC CAGGGGCCTC 
50301 AGTGAGGAAT GTATCCAGCC TATACTGGCT TATOCTTATC GCAAAACCCT 
50351 AAAACAACTA AGAAGCTTOC TTGGCATAAT AGGCATAACA GGCATAACAG 

50401 gi i rcrecro aatatggatt gccaagtaog gcaaaatagc cagaccatta 
50451 TATACACTAA ttaaggaaac tcagaaagcc aatacceatt tagtaagatc 
50501 gacacctgaa ggagaggcag CTTTCCAGGC GGTAAAGAAC aocctaaooc 
50551 AAGOOOCAGT gttaagcttc OCAGGGGGGC AAGACI 1 1 lb 1 1 ICIATCTC 
50601 ACAGAAAAAA TAGGAATAGC TCTAGGAGTC CTTACACAGG TOOGAGGGAC 
50651 CAGCTTGCAA GOCATOGCAT AGCTGAGTAA GGAAATTGAT GTAGTGGCAA 
50701 AGGGTTGGOC TC AI I b l I I A GGGGTAGTGG GGGCAGTAGC AGIU I AGFA 
50751 TCTGAAGCAG TTAAAATAAT ACAAGGAAGA GATCTTACTG TCTGAAGCTC 
50801 TCATCATCTC AAGQGCATAC TCACTGCTAA AGAAGACTOG 7GGCTGTCAG 
50851 AGAA Ub ll l GCTTAAATAT CAGGCTCTAT TACTPGAAGG GOCAGTCCIG 
50901 7GACTGCGCA CTTCTGCAAC TCTTAAOOCA GQGACATTTC TTCCAGAGAA 
50951 TCAAGAAAAG ATAGAACAGA ACTGTCAACA AGTAATTGCT CAAAGCTAOG 
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51001 CCGCTTGAGG GGAULI IUA GTOGTTCOCT TOOGATOC CAACCPCAAC 
51051 TTOTATACTC ATCAAAOTTC CI 1 IGIAGAA AAAQGACTTC GAAAAGCAGA 
51101 GTCTOTAGTC GTOGTCATA ATOGAATACT TCAAAGTAAT OGPCTGACTC 
51151 CAGGAACTAG TCCTCAGCTC GCAGAACTAA TAQ00CTCAC TCAGGCACTA 
51201 GAATTAQGAG AAGGAAAAAG GQCAAATATA TATACAGACT CTAACTATCC 
51251 TTACCTAGTC CTCCATQCTC AOQCAGCAAT ATCGAGAGAA AQQGAATTOC 
51301 TAA LI RCTC A GGGAAGGGCT ATCAAACATC AGGAAGGCAT TAAGAAACTA 
51351 TTATTGGCrc CAGAGAAATC TAAAGAAGTG GCAGIU I AC ACIGCTCTAA 
51401 GAAAGGAGAG AGAAATAAAA GGGAAOGGGC GAGTGGATAT TCAAGCGGAA 
51451 AGAGCGACAA GGGGQGAGOC TCCATTAGAA ATCCTTATAG AAGAAGGGCT 
51501 AGTATOGGGT AATO0OCP0C AAGAAAOCAA QOOCCACTAC TCAGAAGAAG 
51551 AAATAGAATC GGAAAOCTCA TCAGGACGTA Gl I ICC I (XT CAGGATCGCT 
51601 AGCCAOCAAA GAAGGAAAAA TACTTTTGQC TCCAGCTAAC CAATGGAAAT 
51651 TACTTAAAAC GCTTCACTTA GGCATTGATA GCAOCCATCA GATCGGCAAA 
51701 TCATTATTTA CTOGAOCAGG OG I I PCAAA ACTA7GAAGC AGATAGTCAG 
51751 AGOCTGTGAA GTCTGCGAAA AAATAATOGC CTGCACTTCA GGCCATGCAT 
51801 TTCAATCCCT GAAILI I IAA GOP0C1 IGI I AAGI I IGICI CTTACAGAAT 
51851 TCAAGCTGTA AAGCTACAAA TOCTTCTTCA AATOGATOOC CAGATCCAGT 
51901 CTATCACTCA AATCTAOOGC GGAGGCTTGG ACCGGOCTOC TAGTCCATCC 
51951 TTOGATCTTG ATCATATCAA AGGCACCOCT CCGGAGGAAA TCFCAAGTGC 
52001 ATGACGCTTA GTPQCAQCAG TTCAGCAGGA AGCAGTTAGA GGGGCCGTTG 
52051 GCCAACCTCC GCAATAGTAC TTCGGI I I PC CTGTTGAGAG GGOTTGCTCA 
52101 GAGACAQGAC TAGCTGGATT TCCTAGGOOG ACTAAGAATC CCTAAGGCTA 
52151 GCTCGGAAGG TCACTOCATC CACXTTTAAA CACGGGGCTT GCAACCTAGC 
52201 TCACAGOCGA CCAATCAGGT AGTAAAGAGA GCTCACTAAA ATGCTAATTA 
52251 GGCAAAAACA GGAAGTAAAG AAATAGGCAA TCATCTATCA GCTCAGAGCA 
52301 OVGGGGGAGG GACAATCATC AGGATATAAA GGCAGGCGTT CTA GOQGG CA 
52351 AOGGCTACCC TCTTTGGGTA GCCPGCCTTT GTATGGGAGC TCTCTTTTCA 
52401 CTCTATTAAA I'd IGCAACT GCACAAAAAC CAAACCAAAC CAAACCAAAC 
52451 AAAGAAAAAA AOVGTGACTG ACTTATCGTA AACATTATAT AAGCATAAAG 
52501 TAAAOCAAAT AC I MIMIC TAATTATAAA AGTCCTACAC TAACATTOCA 
52551 GAAAACTTGA GGAATTCAGA AAAGTTATTA CTTAGTAAGA GTTGGAATGA 
52601 ATAAATAAGT GGGTAOTTAG GATCGCAGGC ATCTG1 1 I IA GGCAGAGAGA 
52651 TACAAGATAA AGAACTAAAA OAGAATCTC GTCTTTGAAC CCCTGGGCTC 
52701 ATTCTC1 1 A T TCATCATGAT GATTTGCCTA MM ICXAAT TTCTAAATCA 
52751 I TQCT C TC C T GTTGACAAAG CAATAAATTG TTATATTTCA TAACTGAATC 
52801 TPCAGAGAAC TCGOCTTCAG CCACCVCTAC AACTAAOCAG CTCTGTGGCC 
52851 CTTTOGAGAA I I ICI IAATA 1 1 PGTAAAGC TCAGOTTGC TACCAGTCAA 
52901 ATCAAGTTAG TGCTCGCTOT GCTCCAGGGT 7GCTGGAAGG ATTTAACAAC 
52951 ATGTATATCT ACAAAGCACT TAGTCCTCTG CTTGG0CTAT I IGGIGCI I I 
53001 MM I M I CI M M M IAAG ACAGGGTCTT GCTTCAATCT TGCTGAGQCT 
53051 GGATTCAAAC TO0GGGGCTC AAGTGATCCT GCnSGCTCAG GCTTCCAAGT 
53101 AGCTGGGACr ACAGGGGPGC AGCACTGTGC CTGGTOGCAG TCCTOGTTGA 
53151 A I G f I C I 1 1 1 TCCTTAGTTC CTTCCTAGCr CI ICIGACAG 1 1 1 IGGGGCT 
53201 TATCTATATA AGAAGGACTT GGTTGCCTCA GGGAGAGAGG ATOCAGTAGA 
53251 GTTACATAGC TCAOGPCACA TCCTOGAAAA GCTCAATTCA TAAGTAAACA 
53301 AAGTGAGCAT TTCACCCATA CTTTAGAGAA AGTCTAGAAT ATTTATCGTC 
53351 TGCATCAGQC TCACATACTC TGAGCI ICIG AGATACTTTT CCCTCTCCAT 
53401 TCGCI I ICCT TC7TXCTGCT GGCI I Mill I I ICI ICI IC I ICM I I 1 1 1 
53451 I I I I M 1 1 I A CTTJPGAAAAA CAACCTATAT ACAGAATAGT ACAAAAACAT 
53501 ACCTCTATAG TTTCAAGAGT AATTATTAAC AGTCTTATTA AGAAACAATC 
53551 CPOCATCCAT GTTACTOCAA AAGACATGAC OTA I ICI 1 1 I ICI ICI IAA 
53601 MM M I CH I I I CI I IC I I TATTTTGGGC CI II I ICA GA TCTAGAGCTG 
53651 CAGAGATCTT Gl I CI I 1 1 I I 7GAGACAGCA TCTOGCTOG TCAGCAGGCT 
53701 GGAGTTCAGT GGGOTGATCT OGGCTCAC7G CAO0CTCTOC CTCCTOGGTT 
53751 CAAGTCATTC C3GCTOGCTCA GCCTGCTGAG TAGCTGGGAC TACAGGOGGG 
53801 TGGCACGACA TOCAGCTAAT 1 1 1 1 1 IATTT TTAGTAGAGA CTGGGI I PCA 
53851 TCAIGI IGGC CAGGATCGTC TCA AICIC I I GACCPQGTGA TCTGGCTOGC 
53901 TOGGGGGCCA AAAGIGCPGG AATTACAGGC ATCAGGGAOC ACGOOCAGOC 
53951 GA1CIIGMC 1 1 ICI IATGA C I G I G I A GTA TTCCATCGTA TATATCTAGC 
54001 ACAI M ICI I TATGCATTCT ATCATTCATG GGCATCTAGG TTCATTOCAT 
54051 GTCTGCTATT GTCAACAGTG CPQCAGTGAA CATPCACGTG GAIGIGICI I 
54101 TCTGGTAGAC TGCTTTATAT TOCTCTGGOT ATATGCTCAG TAATAGGATT 
54151 GCTGGATTGA ATOGTAGTTC I ICI I I IAGC ICI I IGAGGA TACIGCI I IC 
54201 CACAATCGTT GAACTAACTT ACACTCATAC AOTATATAAG CAI ICC I I I I" 
54251 TCTCTACAAG CTOGGCAGCA ICIGI IACTT TTTCACTTAT AGGTGGGAGC 
54301 TAAATCATAA GAACTTATCA ATGTAAAGAA GGAAACAGAC ACTGGGATCT 
54351 ACTTCAGTOG GAAGAGTGGG AAGAGGGAGA GGAGCAAAAA AGATAACTCT 
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54401 TOCTTACTAA GCTTAATACT 7GGACAATCT AATAATATCT AGAACAAACC 

54451 ccrcraAOvc atctttatct atctaacaaa ccttcacatg taoocctaaa 

54501 QCTA Al MM TTTAAAAGAA ACAGAATCOC AGCCAGGCAT AGAGGCTAAT 
54551 7G0CTOAAT CGCAGCAC1T TCGGAGGGTG AGATCGGCAG ATCACTTCAG 
54601 GOCAGGAGTT TAAGGGCAGC CCAGGCAACA TAGCAGAAGC CCATCICI IC 
54651 AAAAAGTACA AAAATTAGCT GGGCATCGTG GTGTOCACCT GTAGTCOCAG 
54701 COCCTTGGGA GACTGAGCTC GGAGGA7GGC TTGAGCCCAG GAGGTCAAGG 
54751 CTGCAGTGAG CTUTGATCAT GCCACTOCAC TCCAGCCTOG GCGACACAGC 
54801 AAGAGQCTGT CTCCAAAAAA AAAAAGAAAA GGAAGAGAAT GTTACCAGCC 
54851 CAACTQCACT TCTTCACTOC CCCACCACTC TAATGAAGTC ATCACTAAOC 
54901 CACTTOAAA OTACTOVCAT ACCCTATCTC TATOGAGGTA TCTOVGTGGA 
54951 GGCTAAOGTA TGGCAGTGGA GGCTTATGTA COTATCTCT AAATATTTAA 
55001 AGTTATTAAA TTAAAAAAOC ATTAAAATAT GLI I ICIAOC TTGACAAAGC 
55051 TTTATAACAA AATTAGAAAA TCTTTAATCT TATCGCATTA AATAA TTGAA 
55101 AGCAAAATAT CAAAGATGAT AGAATTTAAT TAATTATTTT ATTTTATTTT 
55151 ATTTCAGAGA GQCTC1 I ICI GTGTCAGCAA GGCTGGAGTG GAGTGA7GCA 
55201 ATCATOCjTTC ACRGCAACCT CAACTTC00G GGCTGGAGTG ATCCTCCXDGC 

55251 CTCAGGCTOC caagtgottc ggactacaga catgtcgcac caaatocagc 

55301 TAATTTTTAA ATTC1 M MA ATAGAGOTAA GGOTCTCACT ATGTTGGCTA 
55351 GGCXAGTCPC GAATTXXAGG GGCTCAAGGG ATCOTTTOC CTTCTCCTOC 
55401 CAGAGIG CTC GGATTTAAGT TCGGAGOCAC TATACGCACC CAACATAATT 
55451 CAATTATTTA ATATTTCACA TCTTTTAGTA I l(JLI 1 1 GAT AGGGATCTGA 
55501 TCTTTGGGTG AATAATAAAG TAAATCAAAG ACATATATTT GAAAATTATC 
55551 TAGITATTCT AAAAAATTAA TTATTTACCT TTATTTTAGC AAAATCAGTG 
55601 TCTTAQCATA ATCAAGATAT TTTOGTATTC TAGTAACAAG ATCTAOTCAC 
55651 ACTAATGATC TAAAGATTAA AAAATAAAAT ATAATAQGAA GCAGTATAAA 
55701 CAAGTQGAAT TTAAATTTAA AATCCAATAC CAGCTGGGTC GGATGCCTCA 
55751 CGCCTCTAAT GCCAGCACTT TGQGAGGGCA AGGCAGQOGG ATCAGCJGGG 
55801 GTCAGGAGTT CAAGAOCAGC CTGAOCAACA TOGAAAAAQC GCATCTO0C 
55851 CAAAAATACA AAATAAGT7G GGIGIGGTCG TCCATG0CTG TAATOOCAGC 
55901 TACTOGGGAG GOGGAGGCAG GAGAATCACT TCAACCAGGG AGGCAGAGGT 
55951 TGTGATCAGC GGAGATGACA GCATTOCACT GCAGTTTOGG CAACAAGGGC 
56001 AAAACTOCT CTCAAAAACA AAAAGAAACA AAAAACAGAG TAOCATTTAC 
56051 ATTAGCAOOC CTCAAAA7GA AATACTTAGG TATAAATGCA GCAAAATAGG 
56101 TATAAGAGAT ATATAAGTAA AACTATAAAG CTOGA7GAA AGAAATAAAA 
56151 GAAGCAAATA AATGGACAGA TATTCGATCT TCATOGATAG GAAGACTCAG 
56201 TAATCTCAAG ATCTCAGTCC TTTTCATCTT TATOATAGA 7TCAGTACAC 
56251 TTCCAATCAA AAT00OQGCT AGTTATTTTG TCGATATTGA CAAACTGATT 
56301 CrAAAGTTTA TCTCGAGAGG CAAAAGACGC AAAATAGGTG ACATGCTATT 
56351 GAAGCAGAAT AAAGTTAAAG TCTCTACTTC I ICI ICAATA TCAI 1 1 ICI I 
56401 CAATATCAAA TCAACTCATT I ICI ICAATA TCAAGTCAAC TCTAGACTGA 
56451 CAGTGTCCTA CTTCAAGACT TACTCTAAAG CTACAGTAAT CAGAACACTA 
56501 TGGTATPGGT GAAAGAATAG ACAAACAGAT CGA7GGAACA GAATAGAGAG 
56551 G0CAGAAATA GAGGCAGACA AATACAGTTA ATCTTTGAGA AAGGAGGAAA 
56601 GGCAATAGAA TGGAGGAAAG ATAOOLI 1 1 I CAACAAATCC TGT7GGAATA 
56651 ACTGGATCTC CACATCTAAA AAAATGAATC TAGACACAGA CCTTAGAOGC 
56701 TTCACAAAAA TCAACACAAA ATOGATCATC AACCTAGACA TGAAACACAA 
56751 AACTATACAA ATTTCTGCPC TCTGAAAGAC AATGTCAAGA GAATCAGAAG 
56801 ACTTGGAAAA TATT7GCAAA AGACACATCT GATAAATGAT TCTTATCTAA 
56851 AATATATAGG AACTCTTAAA AOGAACAAT AAAAAAGAAA CCTGATTTTA 
56901 AAATCTGOCA AAGACTCTAA TAGATATCTC ATCAAAGAAG ATATATAGAT 
56951 GGCAAATAAG CATATGCAAA GATGCTGCAC ATCATAGGTC ATCAGAAAAA 
57001 TGCAAATTAA AATCATGAGA TTCTACTACA CACTTATTAG AATGGGCAAA 
57051 ATCCAAAACA CTGACAAGAC CAAGTGCTGG TGAGGATCTG GGACAACAGG 
57101 AACTTTCATT TAI IGCIGGT GGGAATOCAA AATCGTATAG CGACTTTCGA 
57151 AGACAGTTTC CAGI I ICI IA CAAAATTAAA CACACTCATC ATATGATOCA 
57201 ACAATTGCAG TOCTTGATAT TTAO0CAAAG GAGT7GAAAA CTTATATTCA 
57251 CACAGAAAGC TQCAGATAOT TGCTTAGAGC AGCTTTATTC ATAATTGCCA 
57301 AAACTTTGAA GCAAOGAAGA TCTCCTTCAA TAGGTAAATC AATAAATAAA 
57351 CTGTAGCATA TCCAGAAATG GAGTACTCAG TCCTAAAAAG GAATGAGGTA 
57401 TCAAGCTATG AAAAGAGATC GAGGAAOCTT TATA 1 1 1 MA 1 1 1 II I IAGA 
57451 GACAGGCTCT TGCTCTCTCA TCCAGGCTGG AATGCAGTOG CACAATTATG 
57501 GCnCACTGTT GGCTTGACCr GCTGGGCTTC AGCTCTCTTC CTOOCTCAGC 
57551 CTCGCAAGTA GCTOGGACTA CAGGTGCATC TCACCACAOC 7GGCTAATTT 
57601 ITTTTTTTCA GAGATTOGGT CTTOCTCTGl TOG0CAGGCT Gbl 1 1 IGAAC 
57651 TOCTOGGCTC AAGTGATCTT GCTGCCTCAG CCTOCCGAAG TGCTGGGATT 
57701 ATAA GIG P G A GOCACTATGC CTGACI 1 1 1 1 TTTAAATTTA I I I I ICI ICI 
57751 AGAGACAAGG OCT RQC1 1 1 I TATTG00CAG GCTGGAGTOT AGTGATOCAG 
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57801 TCATAGCTCA CVGVCOCCTC AAGATCCTQG GCTCAAGTAG GAGCCCAGCT 
57851 A AI 1 1 1 1 1 1 1 AAAATTTTTG TAGAGATQGA GCCTTGCAAT CTGGGGCATC 
57901 CTATGGAGGA AACTTAAATG CATAI I IUA AGTGAAGAAG GCAGTCTGAA 
57951 AATCTTATAT AOCATOQGAT TTCAACTATA AGACATTCRG GAAATOGCAA 
58001 AAOGAAGGCA ACAATAAAAA GATCAAT7GT CAGGGAGTTG GTCAGGGGAG 
58051 AATCAATAGG TOGAGTAGAG AAGATTTTTA GGGCAGTGAA ATCTCTCTGA 
58101 TACAGTAATG GTOGACACAC GTTGTATATT TCTOCAAATC CATAGATTCT 
58151 ATAATAOCAA CAGTGAAOCT TAATATAAAC TATOGACTIT GGGGCTGGGC 
58201 ATGOTGACTC A I G TOOTAA TCCCAACACT TTCGGAGGTC AAGA7GGGAG 
58251 GATCACTTCA GGCCAGGAGT TTCAAAOCAG GCTGGTCAGC ATTCTGAGGC 
58301 GCTATCTOA GAAAAATAAG GAAAOCATGG ACTTTOGKIT ATAATGATGT 
58351 TTCAGTGTAG GTT0CTO VGT TATAACAAAT GTACTACVCV GGTGGGGGAT 
58401 GTTTATAATA AAG&VGGCAA TCCAIGIU I GGGGCAGGAG GTATAGGAGA 
58451 A AIC IGI I IA OLI I O CTCT A AATTTTACTG TCAATO CAAA ACTOCTCTTA 
58501 AAAAAAAAGG TCTTAAAAAA TAAAATTATA TTTGAGGGAA AATATTTGAA 
58551 TTATTATTAT TATTTTOTT TTGAGAGGGA GTCTUCTCI GTCO0QCAGG 
58601 CTGGAGTGCA GTGGCACGAT TTOQGCTCAC TCCAAGCTCT GGCTCGOGGG 
58651 TTCAOGOCAT TCTOCTGGCT CAGCCTO0CT AGTAGCTCQG ACTACAGGGG 
58701 CAGGCTGCTA TQCQQQGCTA Al 1 1 I I IGTA 1 1 1 1 \AGTAG AGATGGGCTT 
58751 TCACGGTGTT AGCCAGGATG GTCTOGATCT CCTGAGCTCG TGATCTCGOC 
58801 G0CTCAGCXT GGCAAACTGC TOGGATTAGA GGCATCAGOC ACCGGGCCTG 
58851 GCXTATTTGA ATC A1 N 1 IA TCAAAGATCT AAATTAAAAC AAATCTAAAA 
58901 ATAAAAAACA AATCACTCTC TGAT7CTATT TCTATAAATC TCTAGAAAAT 
58951 GCAAACTAAC TTATAATOGC AAACTACTCT ATAGATOVGC ACTTCOCTOG 
59001 AQGCAAGAGG GAAGAATTOC AGTGAGGTAT GATAAAACTT TTOGGGTAAT 
59051 AAATATATTC ATTATCTTGA TTGCAGTGAT GTTCGGACAG GTAGATCCAT 
59101 ATCTCAAGAT TTCTTCTTGA ATACTTTATC TAGTTTATTG CATAACAATT 
59151 CTATAAAATT AAAAATCATA AAATTTTCTT IGI I I IAAAA ACAI ICI I IC 
59201 MICM I Ml CCTGAGAGGG ACTCVCCCVG TATCAOCTAG ACTOGAGTCC 
59251 TGATTCCAGC CTTCAGCTCC TAGGCTCAAG TCATCCTGCA GGCTCAGCTT 
59301 CCTAAGTAGC TGGTACCACA CAGGCGCATC CCAGCACAGC CATATAATTT 
59351 TTAAATTAGT TTTTCTAGAA ATAGGGTCTT GGCATCTTGC GOVGGCTGCT 
59401 CTTCAACTGC TCGGCTCAAG CAATOCTOCA TCTTCGQCTC CCAAAGVQOT 
59451 GAAATTACAG GTGTGAGGCA CTGCAGCTOG CCATCTTAAT TTTTAATATT 
59501 TAAAAGAAAA GTAAGGGCCA GACACTCTGG CTCTOV0CTC TAATCCCAGC 
59551 ATAAGAGCAG CXTOGOTAAC ATCGCAAGAC CXXATCTCTA TCAAAAATTC 
59601 AAAAATTAAC TOGGCATGGT GGTGGCCTGT GGTOGCAGCT ACPGAGGAGG 
59651 CTTJTAGCTOG AGGATCACTT GAGGCTAGCA OGTTGAGGCT GCAATAAGCC 
59701 ATCTTTCCAT CACTOCACTC CAGCTTAGAT GAGAGAGTGA GAGGCTCTCT 
59751 CAAAATAAAT AGATAGATAA TATATGTOGT AGTTTTAAAA ATATATTATT 
59801 AAGAXAAAAA GCAAGGCAAG ACAATTAAOT GGGGGAAGAA TAG I II I IOC 
59851 AACAAATCGT GCTGGAACAA CPGCATAGOC AGAGGAAAAA GAATCAAGTT 
59901 AGATOOGCTA GCTCACAOCA TATAAAAAAA TAACTCAAAA TCGATTAAAG 
59951 ACCTAAGTAT AAGCTGAGAC AAGAAGATTA CTTCAGGCTT GGAGTTCAAG 
60001 AOCAGOCTGG GCAACATATT GAGAGCTCGT CTCTTAAAAA AGA A AAAAA A 
60051 TCAGCOGGGC ATGGCAGTCT CTAOCTCTAT TCCTAGCTAC TCAGAAGGCT 
60101 GAGGGGAGAG GATTOCTPGA GOOCAGGATT TAGAGGCTGC AGTCAGCTAT 
60151 GATTCCAOCA CTOTACTOCA GGCTGGGTGA CAGAGTCAGA CCTTCTCTOC 
60201 TDCACGCCTC CTCCACAAAG TCTAAAGGTA TAAAIGI I IC TOGCCTTOGA 
60251 TTAGGCAATG GTTTATTAAA TATGACATTA AAAGCACAAA CAACAACAAA 
60301 ATAGATTAAT TCGACTTCAT CAAAATTAAA AGCTCTUTOC TTCAAAGGGC 
60351 ACAGCAAGAA AGTCAAAAGA GAATCCACAC AATGGGAGAT AAI 1 1 1 1 IGC 
60401 AAATCATCTA TTTTACAAGA CTGGTGTOCA GAATATATAA AGAAGACTTC 
60451 CAACTCAGCA ATAAAAAGAC AAGTAAGACA ATTTAAAAAT GTTGAAAGGA 
60501 TTTCAATAGA C AM I C I I LA AAGAAGACAT ATAAATCAOC AATCAGCATA 
60551 TCAAAATCTA CTCAAGCTCA T TCG T CA TTA GAGAAATCCA AATAGAAGTC 
60601 ACACCXATTA GGATOGCTAA AATAAAAAAA GATGAAGAAT AACAAATCTT 
60651 GGCAAGTA7G TOGAAAAATT AGAACGCTCA TACACTCTGG ATOGGAATCT 
60701 AAAATOGTGC AGAOVCTTTC GAAAOTTOGC TATTCCTCAG AGATTTAOCA 
60751 GATOGCACAG CAATTCTACT TTTAGGTCTA TAOGGAAGAC AATTAAAAAG 
60801 ATATATAGAG GGGGGGGGCG GTOGCTCAAG CCTOTAATQC CAGCACTTTG 
60851 GCCAAGGTGG GTOGATCAOG AGGTCAGGAG ATQGAGAOCA TCCTOGCTAA 
60901 TACAGTGAAA CCXXATCTCT ACTAAAAATA CAAAAAATTA GCTGGGCGTG 
60951 GTGGGGGGGC GOCTOTAGTC OCAGCTACTC GGGAGGCTGA GGCAGGAGAA 
61001 TC TO G TCA AG OOGGCAGGCG GGGCTPOOAG TCAGCGGAGA TTGGGGCACT 
61051 GGACTOCAGC CTX5GGGAAGA GAGCGAGACT GGGTCTCAAA AAAAAAAAAG 
61101 ATATATACAC AGAAAAACTT GTAGATAGAT GTTCATAGTA GTATTXXAAT 
61151 AAACATCCCC ATCAGTAGAT GAATGGATAA GCAAAAOGTG GTOTATTAAT 
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61201 AAATCAAATA TTATOCAGOC ACAAAAAGCA ATCAAGTACT GAXAGATCAT 
61251 CCAATATCGA TOGAGCTTCA AAACTATACT AAATGAAAGA AACCAQCCAC 
61301 AAAAGQOCAC ATAGTACATC ATTACATTTC TATAAAATCT OCAGAATTAG 
61351 CAATTTXATA AAGACAGAAA GTAGATTACT AOTTCQCAAG GGCTG AGGGA 
61401 AQGAGGAATC GGAGTGACTC CTAATOGGTA CAGGGI I 111 TTTTGGGGTC 
61451 AGAAAAOTGT TCTGGAATTA CATAATCATG ATAGTTCTAC AACTTTCTGA 
61501 ATATACTAAG AGACACTGAA TTCTATCTTT TAAAAAGTTA AATTTTATCG 
61551 TATCTGAATC ATAOCTCATT AAAATAGTTA CATGAGAAAA AAATCAAAGG 
61601 CAAAATACAG AGTATAATTC AAGTATTTTA Al II I IAAAT AT AAAGT A7T 
61651 TATAGOCAAA TTTCATTTAC TTCTAAAATG TCTTATTAAA TAGTTTAATA 
61701 AAAQCAAAAC TCTTCTAQCA TTCAGTOTTT ATTAA7TTGC AATACAGAAA 
61751 CAATATCATG TTTTACTCAT GTTGGGTOCA OCACATATCT ATATATTTAA 
61801 GAATAATCTC ATTOGTOVGT ACTGCAAAAT GTTTPOGTGT AOGGTGGCTG 
61851 TCAOTAOCAT ACTAATTAGT ACACTAAGAA CTATGAATTC GAAGAOGAAG 
61901 AAAAGCAAGA AAATCAACAT TCAGCACTAT TTCGAAA7GA AAGCTCACTA 
61951 TGCAAGAATT CATTCCATTC ATGOCTTCTC ABGGGGAGTC TTTCATAAAT 
62001 TAATATTTCA CTAACCTAAG AGCI 1UGCT GCTCAAATCC TCTACACACA 
62051 CTAATOCAGT GTCAGTCTO GAG I I IGGCA GTCGCACAGC CATAACTTTA 
62101 TGGCAGTTAG GTGAAATCGC 1 1 1 1 IGI 1 1 1 GTOGAGATAG AACAACAGAT 
62151 GTQGAGTTGT A L I I C I CIG G GGOCAGCACA AAOGGCATTA AGTCTCTGAG 
62201 TCTCATTCrc TCAGTCCTGA GCATOCATCT Gl ICI 1 1 GAT TTGCGAGTCT 
62251 GIGI I IG I G A TTOGCAGGTT TCTOTACGT GACTCTGAAG GAGTOTTCCT 
62301 GATCACATCA CACOCATCCT TAGGOCTTAT TGATCATCAG TCTACCTTOC 
62351 CACTACTATA CTTTAAATAG ATGOCTGTTA TTTAAATTTG Al I I IGAGAT 
62401 TAAACAGAAT GGCAGAGACA ATI I IGAGAC ACATC1 1 ICI TTCATCCTCT 
62451 GGTAGGAAGA TCAAGATTTC TAGGAGAGTA GAACAGAGXA AAAGATGAOT 
62501 GCTCTTOGGA TOCTATCTTT CTOCTAAGCT ATTTTTTCTC 7CTCAGTTAT 
62551 TCATCATCTA TCTCAATTTT TOCTAA7GAA CICI ICIATA TAAAAGAGGA 
62601 TCCAGGTCGC ACATOCACTG TCAAGGAGGA ATCTAAGATT GACTTOCAAC 
62651 TCAGCCTGTG TAOGCAGTTT ATGGTTTTTC TTOGCAGGTT TAAIGI ICI I 
62701 TCTTATCTCT TAACTTCTTG CTATTCAATA GTAAGTAACT COCTGGCAGA 
62751 ATTACCTGTG GCTAGAGAAT GLIGI IATCA GAGCAICI 1 1 GTTTAATTGG 
62801 TACTTAGAAC AGAAGGTCTC ACXTATTTGA CAGGGCAACA ATTATCAGCA 
62851 AGGAGGCATT TGATTCATCA AGATAGAAAT CTGCCIGI IA GGTGG AAAGA 
62901 TGTCTATCTC GGTTGATA7G 1 1 II IAGAAT ATTAAGGCTT GTTTGTGCAT 
62951 GACAACTTTA GGAAGGTGTA GPGCAAATCT CTCCAAAGGT TTCCTGTAGT 
63001 ICR IACAGA AGTTOGGCTG CTXjCTGGTOG ACAGTGKJTA ACAGTGAACA 
63051 ATGTATGCnC TAGACTOGGT TOOL I ICUC CAGGCTCTGT CTCTGTGGOC 
63101 TTOGGCAAGT I G I I IAACCA AOCACI 1 1 1 1 GGCTCAGTTT CTTTATCGGA 
63151 AGAAGGAGAA TAAGAATACT TCAATCAGGC CAGGCGTGGT GACTCAOGCC 
63201 TCTAATOOCA GGLI I IGGGA GGOOGAGGTG GGTGGATCAC CTGAGGTCAG 
63251 GAGTTOGAGA CCAGGCK3GC GAACATOGTC AAAGGGCATC ICICCI I I AC 
63301 TTATOCTGGC CTGATATTGA TOGRXATGG TAGAATTGAT ACIGU IGAC 
63351 AAAGCAGGCT ATTTCAGPCA GGAGCLI ICI TCTCTAGTTT CCPCTGTAGC 
63401 TATTAGCTTA GGCTTCCATT TC A I I CMCA CACTACAGAT ACTCTCATTG 
63451 ATAAAGGAAT GAIGICI I IA TGCTTTCAAG CAI ICTGGCA AGTTAGTAAT 
63501 TCAACTATGA TTCTAGGTCA GACAAAAOCA GTTATCAACA TAAGACTCTT 
63551 TTTAATCTOC TOOCTGGTGC COCAAGCACC CAGGCCAATC AGGAGAAACT 
63601 ATGTTCTGCA TTOGTTTAAG GAAO0CGCTT CI I ICI I IGA TACCTGACCT 
63651 ACAGATOCAA TCTATTOOCA GGAAI I I IGA TAAGAATTCT CAAATGCTCA 
63701 GCAAGGCTAT GGCACTGTCA TGACTCTOCT ATTGCTGGTA GTCTCATTCT 
63751 CAGTOTAGGC IGI I IGATAG GTA GTTTTC1 GAAGICI IGI TCATCATAAT 
63801 GGATCATATC ATTTTTAAAA GCAGGAGCTG GGTCATATGC CCAGATTAAT 
63851 TTCAAGAAAG TTOOTATCTT I ICI ICCTAA AATTAATTTT TTATGATTAT 
63901 CAAAGTTTTA TATGCATACA GTATAAAATA TCAGATTTXT ATAAGGCTTA 
63951 TAATAAAGAG TAGCATTOCT CTGG0CTATT CCGGPCTCAT ACTGGGCAGC 
64001 TTACPOCAAA ATCAACTGCT TTCAGCTCTT TTAGOCATTT CTT ATOCTT A 
64051 I C I ICATATT TCAAAAGAGC ATGCTTATAT QG1 IGIA TTT TGAAI 1 1 ICA 
64101 AACTTAGATT TTTATCTACA GACI I (JUT A TGGAAGATAA ATATTTAACT 
64151 CI 1 1 1 IGIGC TACILCI 1 1 I CFCATCTCTT AATCTAGGCT ATCTTAAAAT 
64201 TTTTOGTTAA ATCAATTAAA AGCCAGTATA GTATAGTOGC TTAAGAGTGA 
64251 GGGTGCPCAC CTCAAATTCC AGCCTCAOCA CGCATTACTT TCTGTGACTT 
64301 TGAGCAAGCT TTAAAACGTC AGTGTCTCAG 1 1 1 IGICAAC TCAGTAGATA 
64351 CCTCATAGAA I IGCIGI IGA TATTAAGTCA CTTAATOCTA TGGGCTGAAT 
64401 T 1 G TC TC CGA AAGTTCATTT GTTOGAAACT TAATOOCCAG TGCAIGIGI I 
64451 AAGAAGTGGG ACCI I ICAGA GTTGAATAGG CTATGAGGGC TCTGCCTTCA 
64501 TGAATOGATT AATOGGG1 IG TTCCAGTAGT GGCjI ICLI IA TAAAAGGAAG 
64551 IGI I IG AOGC GCTTTCCT7G GCTCCTCATC CATCTCATOG GCTTAGOCAT 
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64601 GTTATAATOC AGTAGTAAOG OOCTTAOCAG AGACTGGCTC CTTGATCT7G 
64651 GACTTCTCAG GCTCCAGAAC TGTAAGAAAT AAAAGTTTTT TCTTTATAAA 
64701 TTAOOCAGTC TCTQOTATTC TCTTATQQCA GCAAAAAACA GACTGAGACA 
64751 CTTAATATAT ATGAAGCATC TAGACTCTCT GGCACATTGT ACATTTTAAA 
64801 TGCCAGVTAT CGATATCATC AATATCATCA TCATCATCAT CTGTGGCTGT 
64851 ATAATAOCnC CCTUGCATT TAAAGGATCA GGGCTGGTCT AQCAGTTATT 
64901 AATATAAGTG AGCTAGTTGG CTATAAAOCT CTCOATAQG CTTTOOCATA 
64951 AACTTCTGTC ATGGATAATC AGAGAATTTG GACCTCXTAA TCAOVGGGCA 
65001 CTGACAAGAA AAGGGCAGAG GGAGCTGATT GAGCATCCTC AG! ICI I ICT 
65051 ACCAAAGQCC TCAATCAGAC AG ATTOCI » TOCCAGGAGT AGACACTGAG 
65101 GGGGTGGAAA CAGGCTOGA GTTGACTOCA CTGAGAAGTC CCTGAATGC^ 
65151 TAGCAGGGAG GGAATTAAGG AGTTTCGCTG TGGCAAGKX CTACTOOTAC 
65201 AATATTAGGA CAGCTCTTTT TTTAAI I IGI TCTGTGGCTG CATATTTTTC 
65251 ATCTGCACAA CATAACTACT TACTCACTGT Gl IGU 1 1 1 1 GAAATTTATA 
65301 AGATCTAAAT TTGCAAATCT GAGGTA7GCT CTCAGGAAAA TTATATA7CT 
65351 GTTAAAATTC 1 1 IGLICC I » ' TTAGGACACT TGCATOCATC TGGTGAGCTC 
65401 TAOTAG0CTA CATACTCAGC AACAATTAAA GCTAGTTTCT GQCTTGTTTT 
65451 TCCCAAACAA TAATTTCTTC AAAATACAGA AATT7GGAAA GTACGCCATG 
65501 ATGAGAAACT TTTCAAAGAG AATTTAATCT ACAGT AACTA TTTTTTTCCT 
65551 CTGAGGAATA ATTTTG QGAA AAGAAATTTC CTTTATATTC AGQCATATTC 
65601 AGTAAGTTGC CTTCATTTTC AAACAAATTC ATCACCTOC CI I IO00CAT 
65651 TCCOCAAACT TCTTCATTTT CTTCAI ICI I AAACTCTGCA 1CIGICI ICC 
65701 OCrCACnTA COVCCTAGCT AAATCTCTTA AGAGCCAAOC TATTOCCACT 
65751 CCCmCATCT I I 1 1 IAATCT AATGGATCAT AAGICCIGII GATTTCATCA 
65801 CTCACATCTC TCAAATOGCT CPGCTTGGAT ATCTTGCTAC TTCTCATTTC 
65851 AGACAATTGT CAI I IGICAC TTGCAGCAGC CTTOCAGCTG OCTTCCTCCC 
65901 AATCTQCOCT ATCAGCATTC I C I I CUCA A ACTGTTGGCA AAGAGGAATT 
65951 TCTAAAATAC CGI I IGATCA TCTAATPOCA GGTTTTAAAA AGCTAATGTT 
66001 GACTCTCTCA CATTTAAAGA ATAAAGTO0G TAI I ICI IGG GATCACTGAC 
66051 ATGGCTTPGC TCTGATTCTG GCTACATOCT TAGAGAATCA CTTOCAGATA 
66101 TTAATTTTOC ATATGCTATT OCCTTTACAT GGAATCTTCC 7X50CTCCTCT 
66151 TTOC00CATC CTTTGGCTGA TCAAGGGCTA TTCATOCTOC ATCTCACTTT 
66201 GGGTGTTAGC CTGAGGAAAC A GTGG1 1 ATT TTTTCOCTGC OCATCTOQGC 
66251 TOGATAOCCC TOCTAAGTTC TCCCACAACA CACAGTACAC ACCI ICG ICG 
66301 ATTATCACAC I C I I I CI I CI TOGGCGTTG 1 1 PGTCTCTC TTGAATATGA 
66351 Gl I ICI I GAG ATTAGGGAOC TGAGATCOGC AGTGQOCAGC AGAGCAGGAC 
66401 CIGNICCAC GCCCTCAGTA ACTACVCCFG GGGAAGAGAA GCTACAAATA 
66451 AATCTGTATT GCATGAATAA AGCTATATAT COCCI ICJCIG TTCATACTTA 
66501 TCCATTTAAT I TCTGAACTC TAAATGCTGT Al II I ICICJC ATTAAATTTG 
66551 G l I I I A TTOG TTTCAGTTTC CTCTTTATAT GGAI 1 1 IGAG TCCTAATTTT 
66601 GTCAAOCAGC ATATOWCAG I IC I 1 1 C I A G C I I I I G I C 30C G0GGTAGAAT 
66651 GGCAGCTCCA TGAAGACAGG GATTAGIGIC IGI I MGMC ACICCICI I I 
66701 TCTCAGCATC TAGAATAGTG CTGQCACATA TAGATACTCA CAAAGTATTT 
66751 GTTGAATGAA TCAAGGTGTT GCATACAAAT TTGATAAATA AAACTTAGGT 
66801 I I ICI NCAA ATATTAATOC TTAGCTOCAT ICIGI ICAAT Al I I I IATTA 
66851 GAGATAAGCT TTAAAAATCT TO C I IG I CA C ATAAGAGTTA ATTOCATCAA 
66901 TCTTACAGAA CAAGQCTATA CTGAGAAATT CAAGGAACAC OTAATGAAT 
66951 CAGGCTTATT TCATTCTGAG AGAGTATAGA AATGCTTGAT AGTATGCTGT 
67001 AAGTAATTTT ACTTTACGTG TAAGTACTTT TCTCTGQCTT TCCAGAACAT 
67051 GCTGTTOGAG TAAGAGAGGA ATCCOTATT GTGGAGCGAG GGGATAGATT 
67101 TOGATAGAGC CI I CI I I GA G GAAGOTAGAG AGGTCAACTA TTACATATOC 
67151 AGCAGTAACT TCGCTTTCAA AGACTAAGTG TTOCATTCA ATCATTRGAT 
67201 ACIIMIGIG CATCTACTAC AATTTGAAGA ATAGAAGAGG GAAATGCATG 
67251 TCTAAGGCAT GGTGGCAACA TTTAAGAAAC CCAGATTTGG GGATACAAGG 
67301 TCTGTGTGTG CACATOCATC TGTGTGCAAT TTAAATGCAC AGGGTAAAGA 
67351 ACTTTGAGTC AGTGACAATG AGTCGCAATG GTGGTAATTA AGTGCTGGGG 
67401 AAGATCAAGG GAGAGAGAGG GIGCTGAGAG CCAATAGGGT GGAACATGTT 
67451 TCAAAGAGCT GGGTTCTGAA GTATTCCTCC ATAGAAGGGC ATTTTAAATA 
67501 G C I MIMG C G I CCI IA TTC TGTTAAGCAT TATTAAATTC TTCTG0CATC 
67551 TTTAAAAGGT TCGCTAGCTT AGGTOCACTG GCAAGAAATT ATAAAAGCAG 
67601 CATGAOCAGG CGIC G T G G C T CATGOCTGTA ATOGCAGCAC TTTGGGAGGC 
67651 CAAGGQGGGA GGAI I ICI IG AGTTCAGGAG TTTCAGAGCA GCCTGGGCAA 
67701 CATGGTGAAA CCCTGCCTCT ACAAAAAATA CAAAAATTAG CTGGGAGCAG 
67751 TGGCAOGTOC CTCTAGO00C AGCTACPCAG GAGGCTAAGG TGGGAGGTTG 
67801 GCTTCAGOCT GGGAGGTGGA GGTTGCAGTA AACTCAGATT GTGCCACTAC 
67851 ACACAGCCTG GGCAACAGAG GGAGACCCTG TCTCAAAAAA AAAAAATTAA 
67901 ATTAAAACAA TAAAAGCAGT GICCI ICC IT ATAAGGAGTG ATOGAGTQGA 
67951 GAGAGGAGCT I 1 1 POGGCTA GTCAGGTAAG GCTGGGGTAT GAATCTAGAA 
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68001 G l I IOC ATTT AAATAGGAGG GAGCGCTTAG GCAAATCACT TAACTTOGA 
68051 GCTTTGCCTG TTTCAOCTCA QOTTCCTTCA AAGATTCAAT GAAAGGCTAT 
68101 ATATAAAGTG CTOCTAAACA TCATCTGGCA TCTGGCAGCT TCTCAAGAAA 
68151 TCTTAOTTTC Q LI I I UCj CT TACAAAGATA AGATCCTTOC TTTGAGTATA 
68201 TTTTTAQQCT TCCTCATCAT TATTGGTTAT GATTTTAAAT CTTCGTCCAG 
68251 GCACCACCCA TATOGT7TCT GCAGTTAACA AAAGAGGCAA AGGTTCACTA 
68301 CTOGGGGAAA AGAQGTCTAC AGAAATGGGT GTAGAGAAAG TAGATCTTTG 
68351 AAGGGGATCT AATTAQGAAA OTA 1 1 1 1 ICC TGGTGGTCCA GAATTTAAAA 
68401 TTATAGAGTC TTATGAGAAA ATGATAATAT TCAQCTTAAG AACA6TTTAT 
68451 TATTTCCTCT CTATATTCGA GAATATTTTC ATTATCTTAT ATAAGAACCT 
68501 TOTAAAAATT TTTCTTCTTA ACTAGCTTOC CACTATAQQC CATTAATOCT 
68551 GTTATTATTT CAAAGATTAA ACAACTATAG TAATAACAGG ATTATATGTC 
68601 CATTAATTTA TATTACTTCA TTCOGCAAAG GATT7GAGGT AGI I I I IAGA 
68651 AGCATAAAAC ACTGGAGAAA TAAAATACTA GGATAGGAGT AGAAAATAAA 
68701 ATTPGAGCAA GCATCAAAAA TATGACATAG TATA 10,1 IGI TAAGACTGGG 
68751 GGAATGTAAA CACATCAOCA GTCAGGGCCT ATATAGI IGI TAGAGTCGCA 
68801 TAGCAAATTT GACICTAAGC TTOGGCAAA CAACATCAAA AGGGAAGCAT 
68851 GATGGATGTG GTTAAATAAG ACACAATTTA CI IGI IACTT TACTCAAGGA 
68901 AGCAAGTATT 1 1 1 IGCO0CT TTGTTTCTTA TAGGAGATGC TCGCTAATGA 
68951 AGTAAIGI IG GCATTGGCCT TATAGTGGAG GTAATAATGG ATTTTATCAG 
69001 GCA STTTCn TAAGCATCTC TTGATCAAAG ATGAGGCTAT GACATCAAGA 

69051 GACAATTCTG aggocgggag cagtggctca cacctataat ggcagcactt 

69101 1GGGAAGCTG AGGGGGGCAG ATCACTPGAG GTOVGGAGTT GGAGAGCAGC 
69151 CTGGGCAACA TOGAAAGCTC GTOOACTG AAAATACAAA AATCAGAAAC 
69201 CCI G I U I IA CTAAAAATAC AAAAATTAGC TCGGOGTCCT GGCAGTGOCT 
69251 GGAATCTCAG CTACTPQGGA GGCTGAGGCA GGAGAATTGC TPGAAGCCAG 
69301 TAGGTGGAGG 7TGCAGTGAA CTGAAATCAC AOCACTGCAC TCCAGCTTOG 
69351 GTCACAGAAC AAGACTOCAT CTCAAAAAAA AAAAAAAAAA AAAAAGACAA 
69401 TTCTGAAAGT GGTGAGTOGT TCTAOCAGGG GCCAGGATGA TCTCATCTGG 
69451 GTTATOGATT OASG TCTO G CCTCATCTAA GGTGACAGAC AGAGGGTACA 
69501 CCGCACAATC TTCCTATCTT CTTTAAATAT CACIGCM IC AACAGGACTT 
69551 M II IMI II ITTTTCAAAC AGGATCTTCC CCTGCCCTCT CACCCAGGTT 
69601 GGAGGGCAGT GQCATGATCA TGGCTCACTC CAGOCTTGAC CTCOCAGCTC 
69651 AGCCTTCRSA CTAGCTGGGA CAACAGGTGC ATCGCAGGGT GACTGGCTAA 
69701 TTTTAAAAAG I I G I I I CG I I I I I I I I I I I C TCTAGAGACA GGGHPCTCCCT 
69751 AGATTGTOCA GGCIGGTCTC AAACRXTOG C3CTCAAGCAG CTOTCCAGTC 
69801 TTTGCCTCTC AAAGIGI RGG GATTACAGAC GTGAGGCACC AOGGCTOGCC 
69851 CAAAAAAGAT AATAATCTOG TTATTOGAAG AAGTGATTTC CTCTCAAAAC 
69901 ATAAATTCAT TTCTTCTTTT ALILI IGIAA CI I ICIGAGG TGAAAAGTAG 
69951 GAAGTTOXA GAI 1 1 I 1 CAT TGCTCTAAAG AAAGATTATA GAGAAGGAAG 
70001 GAAGGAGTTA GAATAAOCTG TCTGATAATG AATTAGAAGA GTTOGAGGTG 
70051 TATCTAAGTA TOCTCAGCAT GAATTTATCT TTAGCTTAAT GTAGATACAG 
70101 ATOGTTAOVT GTOGAAAGTA TTTATAGCTA TCCATAGATA GGTTOGTATA 
70151 TGTACATGTA 1 1 riClA OCT CCI I IGGCTA AGAGGGOGCA GAAGGCATGA 
70201 CATCCCTGTT GCAACAAAAA CAGCTAGCAC CATTTATCTT GGI I IG I A AT 
70251 ACTATTCTCC AGTAAAAACA AGCAGGGCTC CTTGGAGAAA GGGCTGATGA 
70301 TAATATATAA GATTAGOCTG GAGCATCTTA TATATCAGAA AGCAAGGGAG 
70351 TACTCAAAAA CTAAAAACAA TAAACTGCTC OGCAATAATG GGAGTATCTC 
70401 AAAGGGTCAC AGGAGGCATC TCAAAGAGTT TGCAATAGCC AACAAAATGA 
70451 AGAAGTATTT GAATTTAAAT GAGAGTATAA AATAAATATC TAOGAGPCCA 
70501 TAATGATATA AACAAGTGAT TCAATAAATA AATAAATCTG GGAGACTAGA 
70551 CAAATO00GC ATOCAGAAGA ATTCCAAGTA ATTTATCTAG GTAAATACTT 
70601 CACTGTCAAA GAGGCAGAGC ATAATTAATT GGCCACT0CT GTAAGTCTGG 
70651 GCTQCTTAOT GACI I CCI IC CAGGAGTGCA OTATAAATAG GGAAAAGAAG 
70701 ACAACTPCAC AGTGGAGGAA TCTGGCAAAC IC IG I O JCAC CCAGATCATC 
70751 AAGGTTTACA TCAAAAGCAC TAAGGCATCT TGATACTGCA CTGTGTTTCC 
70801 T7GATATAAT GGGATCAAAT GGCACI I IGI GTAGTOGTOC TTCCAAAAAA 
70851 GCTCATAGCC CTAGACTAAT CATGGGAAGA ACAGCAGACA AATOCTAATT 
70901 GAGGGAGATG CTAGTAATT CCTGAAAAGT ACTCCTCAAT GCTGTCAAGG 
70951 TCATCTGAAA CAAGAAAAOT CTGGCAAACT GTCACAGTCA AGAGGAGACT 
71001 AAGGAGATAT GAGGACTAAA TATAATCTGG T AICI IGI A T GGGAICI IGG 
71051 AATAGAAAAA GGATATTAGT TAAAACTGAG GAAATCTCAG TAAAATATAG 
71101 AIGI I IGI I A CTAATAATCT ATCAATAGTA GTTCATTAAT TGTGACAAAT 
71151 GTAAGATACT ATOGTAAGAT GTTAATAATA GAGGAAACTG GATCTGAGGT 
71201 ATATOGGCAC TOGCTOTACT GTTCACAATT GTTATCTAAA TCTGAAACTA 
71251 TTCTAAAATG AAAGTOTATT TTATTTTATT TT A I I I PSK5 AGGGAGICI T 
71301 G l I C I G I PGC GCAGGCTOTA GTGCAGTCOT GAA AT CTOGG CTCACTGCAA 
71351 GCTOCAGCTC CIGGICICAA GGGATTCTOC TGGCTCAGGC TCCTAAGTAG 
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71401 CTGGGATTAC AGGCAGGOGC CAOCACAOCC AQCTMTTTT TATA 1 1 fl IA 
71451 GTAGAGATOG GGTTTCACCA TCTTGGGCAA GCTCCTCTTG AACTCCTGAC 
71501 CTOVGGTGAT OCAOCTOCCT OQQCCTOOCA AAGTGCTGAG ATTACAGGOG 
71551 TCAGCCAO0G GG00CAGACA AAAGTTTATT TTTTAAAAQG TAGGAAAGTT 
71601 TCACAi I I RG ATOGTACTAT TCAGATAAAA LI PQOTTC1 I GTTCTTGTTT 
71651 TTCAGATGGA GICI IG I A CT GTCACCTGGA CTGGAGTGCA ATGGCATCAT 
71701 CTTQGCTCAC TOCTAOCTOC ACTT0COQQG TTCAAGOGAT TCTCCTGGCT 

71751 cAGceracrc agtagctggg actacaqqca cctgcgacta cgcccagcta 

71S01 ATTTTTTGTA 1 1 1 1 \ ACT AG AGATCGCTTT CATTAIGI IG GCCAGACTOG 
71S51 TCTCTAACrc CTGACCTtGT GATCGGCCTA CCTTOQCCrC QQGAAGTGCT 
71301 GGGA7TACAG GCA7GAGCCA OOOTQCCTOG CCAAACTTOG IGI 1 1 1 CATC 
71951 TGAGAATGAG GTTTCCAAGA ATTGAAGATA CAAGTTAGCT AAATAGTATC 
72001 AGTCAAAGCA CAGAGTATAA CTTCAGAGCA CI IGlGbl I I TAAGGCAGAT 
72051 CTATQOCACA GAGAAGTATT TGAATTTAAA TCAAACTATA AAATAAATAT 
72101 CTATOVGTOC ATAATGATAT AAAGAAGTGA TTGAATAAAT AGATAAATCT 
72151 GGGAGACTAG ACAAATCnOC CATGCAGAAG AATTOCAAAT AGTTTATCTA 
72201 GCTAAATATT TCAGAGTATC AGTACTATGA TOC 1 1 CI I AC AATGCTATAT 
72251 CCGACTAAAA AAAAAAAAAA GAAAAAAAGA AAAAGAACTG TGATCCTAGT 
72301 TQCGAATGTT GACTTATAQC CAGAAAAGAT GACTTTOTA TTTCAAAGCA 
72351 TCATCAAATC CTATCAGCAT GCAGAAATCA GAGGGCATAA CAGAGAGCTA 
72401 TTCTATOGTA 1 1 1 I I A TTTA AIGI 1 1 I AAA AGAGCTATTA ATAAT ATAAT 
72451 GTGAAGAGAA Gl 1 1 IAATTT CAGAGGAATC GGAIGI I ICG TTTTAAGATT 
72501 A l l I I I G CAA AATCTAGTCT GTCATTTTTA AAGTCAACTT TATAATCAAA 
72551 AGAGTTTCAA ATTATTTOOC TATTATTAAT ATTTATTTAT TTTTATTACT 
72601 IIICIG OCAT GACAATTACA GTGCACATTT TOCCTATTTT TAGLI I II I I 
72651 CTGGTAAAGT GATTTTAOCT GCTOTAGATC GTCAGTCOCC AGACI I ICIG 
72701 GCACTAGGGA C I GGI I I CA T GGAGGAGAAT I I 1 1 IGACAA ATTGGAGGGG 
72751 GAATGATTTC AGGATAAAAC IGI ICIATCT CAGATCATCA GGCATTAGAT 
72801 TCTTACAAGG AATCTGCAAC CTAGATCOOC TGCATCCACA OTTCACAATA 
72851 GG G I I I G I G C TOCTATGAGA ATCTAATCCT GIGCIGATOC GAGAGGAGGT 
72901 GGAGCTCAGG TOGTAATCCT OGCTCAOCTA CCACPCAOCT CCIGCIGIOT 
72951 GGOCTGGTTC CTAACAGGCT OGGGGGCTGG GGAOOOCTGC TCTAAATAAC 
73001 CTTTOGAAGA TCTGAAAATT AACTTTAGTA I 1 1 I 1 1 IGIA TTTACTOGAT 
73051 ATTTTAAACA AACAAAAATC TAGAGAATCA CATAAOWVT GTAI 1 1 ICTG 
73101 OGTATOCACT A L I CI 1 1 ACT CAGATOCTAA CATTCTCAGT TTTCAAAAGA 
73151 TCTAGGCTOG TCACAGTCGC TCATGOCTOT AATGTCAAGA CTTTAGGAGG 
73201 CCAAGGAGGA CAGATCADGT GAATGCAGGA GTTAGAGAGC AGGCTGAGCT 
73251 ATAAGGCAAA AC30CIGPCTC TACAAAAATT AGCCAGGTCT GCTGGCCPGC 
73301 AOCTCTAGTT CTAGCTACTC GGGAGGCTGA GGTGGGAGGG TCAATTCAGC 
73351 AGAGGCTATA GTCAGCTGTG ATCCTGCCAC TACACTCCAG TCTAGGTGGC 
73401 AAAGCAAGAC GCTGACTCAA AAAAAAA AAA A A AAAAAAAA AAAAAAAAAG 
73451 ATGTAAACAT TACAGGOOCA GCTGTGCTCC CGTCTACACT TCTOCATTOC 
73501 TAATOOGCTT GATGCACACT TTOOCACAGG TAATCACTAT CCCACATTTG 
73551 CTTTAATTAT TCCCATCATC TTTTATCCTA TGACT ACAAA TCTC TGAATC 
73601 TATATTCCAC TGTATCTTAA TAGCAACATT I I I ICATTTA TTTTAGOGCT 
73651 GATGGGCAAT TTAGGTTATT TOCAGATTTT TACTATTACA GATGCI ICIG 
73701 TCAACATOCT TACTCATTTC TOC I IGIGCA CATCTGCAAA A l I I CIC I G G 
73751 AGTACATAOC TGGGAGTGGA ATTACTAGAA IGIGGIGGAT GTACAIGI IG 
73801 TTGTATTTAT CAGTAGTTCA CTCTTATTTA TTGCTCTGTA TTGCATTAGA 
73851 TAGATATAOC ATA A I I IG I I TATCTAATCA OTATTTATA GACATTTGGG 
73901 1 I CMIC I G I 1 1 1 I I PG ACT ATTAGAAATA AAGCAGCTAT AAACAI I IGI 
73951 ATATAAGTTT TTCTATAGGC ATATGCTTTC Gl I ICICI IA GGCTCAGGGG 
74001 CTOGGAAOOC GCTAGGAATG GAGGAATAGC TCGGTCATAT GGTAGGTTTA 
74051 TGTTTAACTT TTTAAGAAAT TCCTGAACTC CTTTQCAAAG IGI I IGI ACT 
74101 ATTTTATATT GOCAGTAGCA GTATGTGAAA GIICI I GI I G CTCTAGATOC 
74151 TCACCAACAC TTGGTATGAT TAGTCATTTT TAGATATOCT GTCATCTGTC 
74201 TG GI 1 1 1 ICA I IG I 1 1 I A AT TGACAGTTOC CTGATCACTA ATGATATTGG 
74251 GTGTCATAGG CATA1 1 1 ICC ATTTATATAT C I I I I I I G I A AAGTTCAAAT 
74301 III I IG OQCA 1 1 II I A AAAT TGAGI IGCI I GGTTTTTATT GAG I ICIGAG 
74351 A A I I I IG I A T GTAATTTATA TACAAGTTCT TACTAGATAT CTAATTTOCA 
74401 AATCTTCTCT CTCAG1 1 IGI GGCI IGICI I TTTATTCICT TAGCAGTGTC 
74451 TTTCAAAGAG AA GI I CI I A G TIN GATCAA GTTTAGTTTA TCAACATTTT 
74501 C I I GIA CTCA 1 1 I CI I G I A T GGGCATTTAG GM 1 1 1 CATC TTATATCTAA 
74551 GAA AICI I IG CCTAAACCAA GATCACAAAG Al 1 1 1 CI COG GTAI I MCI I 
74601 C I n I G I I I G AGACAGAGTC TCACICIGI I GGOCAGGCTG GAGTGCAGTA 
74651 GTOCAGTCTC AGCTCACTGC AAGCTCCACC TCCPGGGTTC AAGOGATTCT 
74701 IGIGCI ICA G CCPOCTGAAT AGCTOGTATT ACAGGCATCT GCCACCATCC 
74751 TCAGCTAATT TTTCTATTTT TTTAAGTAGA GACAGGGTTT CACCAIGI IG 



FIGURE 3-22 



WO 03/076644 




28/65 




PCT/US03/06667 



74801 GGAAGGCTCG ICI ICAATTC CTAACCPCAG GTCATCTGGC CAACTCAGGC 
74851 TOXAAAGPG CTGGIGACAC AGGAXTTTCC TCAGCTACTT TCCCAAGCAG 
74901 GGACTCACTC GQOCATOQQC CAAGGCAGGC CQCTCACTTC GTCCACXTCT 
74951 GCrATAGCTT CTACTCAOCT TCAGOQGTTC CX3GAGCTCTT GTCATCCATC 
75001 TAAAAAGAAG GAGGATATCC TCAGAATTTC AAGTCTAAGG ATGGGTOGAG 
75051 AAGAATTTTA CTGAGTTATC GAACAACTCT CAGCATTAAG GGGACAOGQG 
75101 GTGCTGGCTC AGCGDCACAG TCAGGTOOTT 1 1 ICICICIC TCTCTCTCIG 
75151 TCTCGGTCPG GGG L1 I I I IA TCGACTCAGA ATOGGGAGTC TCTACAGATT 
75201 GGTTTGTGAG TATGCAATAA AAGTTAAAGT GAAGAGAATA TTCAAAGGTG 
75251 GGQGCGATAG TCTAGAAAAC CAATTAQGAA AQQCTAQCTA TCTAGCCTOG 
75301 CATCOTGGCT C3GTOCCTCTA ATOOCAGCAC TTTGQGATQC CAAGGCAGGT 
75351 GGATCACCrc AGATCAGGAG TTTCAAACAA GCCTGACCAA CATCGTGAAA 
75401 CCXXXnCTXT ACTAAAAATA CAAAAATTAG CAGGGTGTOG TOQCACACAC 
75451 CTCTAGTCGC AACTACTTCG TAGACTGAGG CAGAAGAATT GCTGGAAOCT 
75501 GGGAGGCAGA GCTTCGAGTC AGGGAAGATT GGAGGA7GGC ACTVCTGOCT 
75551 GGCTCACATA GOGAGAGAGT GTCTCAAAAA A AA AAAAAAA AAAA AA AAAA 
75601 AGAAAGGGTA GCTATCTGTA AAATAGGTGG AGGGTGGGGA TCAATCAGAG 
75651 GAAAGCATCC CAAATOQGAA GACAGGTTCT CAATCCAGTC CATCGATTTC 
75701 CCPGGGACTT GTAGCTAGGC TTTAAAGTCT CTTCAGCTTC AAGGTCAGGT 
75751 TTCACCAGQG AOCGOTTCCT ACAOCOCTAT CTGGCTAGGC ATTTCTCTCC 
75801 CTCmOCTCC TATCAGTTCT OOCCTCTGAA GAGGTACATC TAACTOQCAT 
75851 TAGAATAGGG ATCATCACTG ATACTAACTC CTTTXTCCTC ACAGQQOTGC 
75901 GGQS re crei TTTOQGAAAA TGGCAGTCAG ATCPOOCKA GAGQCCTATC 
75951 TAAGGGTOGC CAGTAAAAGG GAGGGATOGT CTGAGGCTOC ACTTTCATCA 
76001 CTOGAGTTTA ATOGOCTCAA AATCAGAAGA CAAOCAGATT ATTAGAAGGC 
76051 ATGTATCAAA ACAAAATAAG GGGOTAAGGA CAGCTCAAAA ATOCTGAGGC 
76101 TCOCAACATC GGCAGATAAC AGCTGGCTAT AGTTATOOCT GCTAAGATTT 
76151 GGGTGAATCA GGCTOGGCTT TGGTCAGCTT CI I PGGTC1 1 ATTTTOCCAA 
76201 ACAAAGAAAC CTCTT3GGTTA OQQGCACOCT GTTTACTCCT ATC AGCTO GC 
76251 AGGATTTCCA GGATAATTCC GGAGAACTAG AATATTCATC CAGAI I I MA 
76301 CATCAQGCAT OCCTTTTCTT ICI ICIGAGC TCCAGCTGAT GATCACTGCT 
76351 TQGTKACAG AAATAAGCAG GOTTAGTCTA AAATCCAGAC AAA AACTTA A 
76401 AAAGAACTAA TCAGACTAGA ATTTAATGAA AAGTCTATCA TAAATTTTGA 
76451 AACATAATTT TTOCTCTCC AGTOCTCATT I I ICI IAAAA ACAAATCATG 
76501 ATAGGACTCA GTCATTTCCA GAATAAACTT TAGTCTTATA TTTCGCCTGG 
76551 TTATTTCCAT AAAGCACAGC AAGAATAATT Al I 1 1 ICACA CAGGCTTTTA 
76601 AAATTOGCTT TCATCGAACT CTCTTOCAGA AGGAATTTCA GATAAGACCT 
76651 TTTAAAGCTG AG00CAG0CA TOGG1 I PGTA TOCTCAAATA OCTATCAGTT 
76701 GGGTAAATTC C I C I CI ICI I GAGGTCGGAA GATAACATOG GGTTOCTQGG 
76751 CCTATTAGAA AGTCACATTC TTTATPCACC ACAGATTAGG AACTCPOTAC 
76801 AGGGACTCTC TAGAAGACAA AGTATCAGGC CAGI II ICOC AAGGGGCTTT 
76851 TATTGGTTCT GCAACTCAAA CTTCAI I ICI TAAAGGTAAG CACACOOTC 
76901 CAOTCAAAGC CTTOGTAAAA CAACCAGTTT CTTXAGTTCT GIOUCI IGC 
76951 AAAAGAAAAT GGAIICNAC 7GCACTGATC GAAATAACTC TAI IGCIGCA 
77001 AATTAAGAAT ACTCACAAAT AGI I ICCAAG TTCTCAGGAA AGCAGGCAAA 
77051 AAGAAATAAA TCTOCTOCAA ATTTTCTTCA CTQGAGTATA CCTTACTCAA 
77101 I ICI IAAAAG CTATAGATAG CTCAACATCA AAGTTTOCTT GACTCTCAAA 
77151 AACAAAACAA GGATCAGCAA TCI 1 1 IAAGC AAAAAAAGAT TACTTCAGCT 
77201 TTOATTAGT TCAGTAGATT CTATTAACTC TTCTTC T G CT TCATATTCAT 
77251 GAACATTTCA GCICI I CATC AGPGCIGT A C ATTTTOCCTC TATTOCAATC 
77301 TCACAATCTC CAAAGTTATC AGAAAGCFGC ATTTCAGAGC AOCTGHTCAAA 
77351 GTOOCATAGC TGATTATAAA CCAICI 1 1 IG AAAAGGATCA AAATAAGACA 
77401 ATTCTUTGTC AATCAGAAAA TGTCTTTOGG TAATAACAGT CAAAGCCATG 
77451 ATTCAGAAAG AAATTTCGTT Al I ICIGAGC TTTACAATAA CAAGATAATA 
77501 AMI llllll 1 1 1 1 1 1 1 1 1 I 1 1 I PGAGAQG GAGTCPOGCT CTCTOGGGCA 
77551 GGCPGGAGTG CAGTOGOGGG A TCTOGGCTC ACTGCAAGCX GOGOCTCOOG 
77601 GGTTCACAGC AITCI O CT G C CTCAGCCVCC CAAGTAGCTC GGACTACAGG 
77651 OGCOOGCCAC TAOGQCQGGC TAAI 1 1 1 1 IC T A I 1 1 1 I A GT AGAGAOGGGG 
77701 TTTCAOCGTT TTAGOGGGGA TOGTCTOGAT CTCCTCAOCT GGTGATOGGC 
77751 COGCCTOGGC CTCGCAAAGT GCTCGGATTA GAGGOGTCAG OCAOQGOGCC 
77801 CGGCAACATA ATAATTTTAA TTAGGATTCA TAGCATATAC TCAGACATTA 
77851 GAATTTTAGA AACCTCATAG AAI 1 1 ICGAA CATATCTATT TTTCATTAAA 
77901 ATATAAGCTG AAGAAGATTA AACATTATTT TTATTTTCGC AATOOCACAT 
77951 AACTAAAGAT GTCAGITAAT GCTGTTTACC ICICI 1 1 ICG ATCCTGCAGG 
78001 AGOCCICTCI AGTATTCAAA AGTAAGGGGT CAGAAAAGAC AAGCTTCAAA 
78051 CTOAAGTTTC ATTTTOGGAA GCCICI IAAG TAGATTAGAG GTTTAAAAGA 
78101 CTTTATATTA TCAAATACAA TTCCAGATTA GCATAAGTCA TTTATTTAGC 
78151 CAAAATCATC ACTCAAAAAT TTTTAAAAAG GTAAAAACCT TTACTCATTA 
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78201 AGAGPGAAGA CAGCTTTCCA AACAAACAAT GCATCTCP3G TCTCTCGCAC 
78251 AOCCTTTATT I I I ICATCAA ATCTTTAGAT AATTCTCTOC AATCTT AAGC 
78301 AGTTnGACCA TCAGGTCAGA TTCTTATAAA GCTTTACAAA TTTTTCTTAA 
78351 AGAGTAGATC AGTGGCTTAA GAAAACCTTC I IU 1 1 1 ATT CTAATCTTCA 
78401 ATTTACAGAA AAAGCATGTA ATACX3CTTTT GAATTTAGTC AATATCTTCA 
78451 CACAOVGAAT I ICI 1 1 IGCA AGATTAATTT TTACAAACCT TCCACAACCT 
78501 TCAACTTTAT CTTATCCAAC TTAAAACAAT TATTTAATOC TCTAAACTAG 
78551 GCAAAAATTT AAATTOOCAT GO G IC I I A T AATOCTTTAC TAAAAACACA 
78601 T7TACTTTCC TTACACACCT TCATCTAAAT CPGTTTTCAG TAGTCTCAAT 
78651 TACATOGTAT AATOGTAAAC OTAGCAATT TTTAATTTTA ATCTAAAGCC 
78701 TGGTAAGTTA TTTTAATTAT GTOCTAOCAA TTATAOCTTA AGTTCTAGOG 
78751 ACTCTOGTCT QCTATTOGTA ATATOGOCTT ACAAAACTGA AAAGCAAGCA 
78801 AGGTCAACAA I 1 1 ICAAAAG CCCAAGAAGC AQQCTOGGTC TOCTGGCTCA 
78851 CACCTCTAAT GGCAGCACTT TCGGAGAGCC AGOCAGGTGG ATCAGCTGAG 
78901 GTCAGGAGTT CAAGACCAGC CTOGCTAACA TGOTGAAAAT CX3GTCTCTAC 
78951 TAAAAAAAAA ATAAAAAAAT TAG I PGGTCA TTATQGTOTA TCCCTOTAOT 
79001 OOCAGCTACT CAGGAGGGTC AGGCAGGAGA ATCACTPGAA CCTCGGAGGC 
79051 AGAGGGAAOC AAGATCATGC GACTOCACTC CATCCTOGAC AACAGAGTGA 
79101 TACTOTTCAA AAAAGAAAAA AGAGAAAAAA GGCAAAGAAG OVGTTTATAA 
79151 CTTTAAAGCA TTTAGTAAAC CTAGTATCTC AGCTCCATAA TTTAGAOCAC 
79201 AIGI I IACAT TTTCAAGAGA I I I GIA TTTT AOCAATAATC TCTAAAACTT 
79251 TTTATTTTTC AAATATTAAA ATCATTTCAA CTAAAAGGTA TTATAGCTTT 
79301 T AU III CX T TCAGAAAATA TTTCATCTAA GIGLI IATTT TTOCTAAGC 
79351 CAATTAATTA GAG CTC1 1 1 1 TTATACAAAC ATCACACATA TTCCACATAT 
79401 ATAACTACAC AGACAGAGGA TOCAGTAOTT GTAAGATTTT TCATTGTOCA 
79451 ATCTOCTAAT TAGATTACTG AOCTCAGGAT GGAGOJU PC AAGAGCAGGG 
79501 CTAGGAAAGC ATCCAGTTTC TAGGGGCTAA TAAATAGTTA TAGCTCGAAG 
79551 ACAAAAACAG ATTTTCAGAG GGATTTATCT GCTTTTAATT CTTTGGGTTT 
79601 CATGAGGAAA ACAGAGGTTT I I 1 ICIAAAA TCGGGTCACT GGTGGCTCTT 
79651 CCATTTTTTC CAGGGAGTCC CAGGGCATCA GAAGTTATCT TAGGGCCTCT 
79701 CATOOGTGCA TTAAGAGAGG CAAGACAAAA TOGAGAAAAG TAATTCAOTT 
79751 GACTGAAAAA GAAAAIU 1 1 TTCCAGTCAA AGAAGATOCA AGAAGAGGAA 
79801 AACATAGAGG CCTTTTAAAT ATGGCTATAG CTTOGATATC CACTTTTAAT 
79851 TAAGCnSACT TTTTACCATA GTOCTCT IAT TTTAAAAAAT CCTTTTAAAT 
79901 COL I IU IAC GCAACTTTAG CCTCACCAAG TGGCCAATAT I ICTGGCTTT 
79951 TCAACTTTAC CAAAAGTAAC CTCACAGCjTG AAAOCAACAA GCCTTAAGTA 
80001 AGOTTCTGAC TTCACTGOCT GTCTACAAGG TATTTTCAAA GAGATCGTAA 
80051 GGA GTTTTTA CAAAATCTAG AATCTTTAAA GATAGCTCAG AGAAAGGAAG 
80101 ATTTAAGAAA GGAAGCTAGA AG1 RG1 I CAT GGAGGGGAAG AGAATCAGCA 
80151 AATCATAAAA GTCACACAGA TATTAGOCAG AAAGTACPCA TTGOCTAAGC 
80201 CAGGATTGAA GCTOGGCTTC CATTCTAAAA TGGCAGAGAC CAAAAGAAAG 
80251 ICJL1 1ULACG TCGTTACAAG GPCAAGCTCC CAAGGAGATA AAACAAGATC 
80301 GAGAGCTTAT TCAGTTTTTT I IOLI ICAGA GAGCTGTAGC AAAGTCTCTA 
80351 ACTGAOCAGT TTGGATGGCT GGCTTCAACA GTCGGCTTAT GGGGTCCTAG 
80401 G TCTCTC T TC TATOCTGTCT TACTCXTTAT GAGAGAACTG TACAGAAAGA 
80451 CACACAAAGC ATAGCAGATT GGCTACAGCT TAAGATTAGC CTCAOVAATC 
80501 CTTTTTTOCA TTAATOVAAA CTTTACAGAA GAATAAACAG TCAI I 1 1 IAT 
80551 CCTTOCTTTT ACTQGTTTOC AGAGGGAGAA AGAGGGCAGA AGTCTGACTG 
80601 GTAAGAACTT TTACICI 1 1 I ACTOGCATCT CAGGCTTCTC GGTTCOCTTC 
80651 CCAGTTCAAT TTTAAGOCAA GCAGTTTAAG GTTTOGGGAT ACTAAACTTT 
80701 TCACAGATAT TTTOCAGTAT GTTTAAATAG TTTTCTTTAG CGCAGATTTA 
80751 ATAATAGTTA TCTUTAGTAG GGTTTGCATG ACXACTTTAT TCCAATCGCA 
80801 CAAACAGAAG TTGATTAGCA AATTATATAT ATATTTOCAT AOGTTTATTT 
80851 TAGATAATTT AAGGCTATTA GAOCAATATT GGTTATTACA GAGCATAGAT 
80901 ACCTTGAAAG AGGGCAAGAT GGAGTGGAAA TCCAGGTTOG AGCTACCAAA 
80951 GGGCAAAAAA CTATTCATCA ACCTTTAACT ACAATAAATC TTOGCTTTOG 
81001 GTATTTTCAT TTCTAAACAC TTTGGGCTTG OVGTCATTTA CATCGGCATA 
81051 ATTACAACAC CTTTOCTTTA TTTGGCATCC AGGAGAATCT TCTTAAATAT 
81101 TAAACCTAAC CACI I IGAGT GATTTCCATC CTGCTTTTCT GATAGATTTA 
81151 GAGGCTTOGG Al 1 1 M IUL AGAATOGAGT AGGAAAGAAA TAGGGTCTCG 
81201 ATAGGGAAAT TCAAAAGCTT CCTCGTATTT TTOOTTCAA AGATTTCTTA 
81251 ACATOGCTTC TTGGATGTCT CTCTCTGATC TCAAACATAC ACAGATATTC 
81301 AAATAAGAGT TATACAAGCA CATC1 RG GAC ATTTTTCGCA TCTATGTCTC 
81351 AAGACACAGG ACATTCTATC TGGTCTTCTG ATGGAAGGAC TTTTGCATCT 
81401 TACTAGACTG AAAATTATTC GAAGGTAGAG AAATCTCTTA 1 1 101 1 1 I IA 
81451 TATTOOCAAC AGOCTAGGAT AGAGGCTAGA ACATTAAAGA ACACTAAAAT 
81501 TTTAATAGTG TAACTGAAAA GCAGGTTAGT 7GGTCACTOC ATCTAGAGTC 
81551 CAATTAACAA GAGCAAGTTC TCATACAAAG AAGTCTATTT ATTTCAAAAC 
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81601 TAGCTTAGGG GAAGAGGCAC AAAGCATOCT GCCTTTAAAT GTGCCACTTC 
81651 AOL 1 1 PGGAG CAAAAAGTGG GCATTTTTAT AAGGTAGGGG AGGAAA1GAG 
81701 CAAGGGCAAG TCPOCCTCPG CTACTQQQCA AGTATCTGAG CTGGCAGCTT 
81751 CTTCGGCAGA AGTAAG1 PG1 AAAAGTOQCC AAGTGGCTAT GCTTTCAACA 
81801 TOGGCTOCTA GTGGGCATCA GTTCTGAGAX GAOGCTGTGG AGAG1 PCTC1 
81851 GQQQQCATCC TTTOGTCTGC AAATAGACTG TTAALI I ICG AGGAGAGATPC 
81901 CTTOQQQQGA AAATATATAT TAGGAAGTCC TCTUTQQGTG nTTCTAGAA 
81951 GGAGCTAGAG GGACTAGGGC TCGM IGITA TTTATTTATT TATTTATTTA 
82001 rTCreTCTCT GTOTATCTGfc GAGAGAGAG* GAGAGACGAG 

82051 ctci pqgpqc gtpgggcagg ggggagtgca gtggcatgat catagcttac 

82101 7GCAG0CTCA AACPDCTOGG CTTjCAGGGAG CCIGUGGCT CAGCXTO QCA 
82151 AGAAGCTGGG OCTAL I U PG TCTCCTAOCA TCCCCAGCTA GM I MAAAG 
82201 T A I I I I I 1 1 1 TCTACAGATC GG GTC 1 PSCT TTCTTOCOCA GQCTOGQCTT 
82251 GAACTACTOG CTTCAACTAA TCCTCCTAGC TOVGQCTX30C AAAGTGCTGG 
82301 GA7TATACAT ATCAGCCACT GTTOCVGGTC TAGTTPGCAC TTTTTTTTTT 
82351 TTTCAAACAG AGTOTACTC TCTTCGGCAG GCTQGAATGC AGTGOCACAG 
82401 TCTCAGCTCA CTGGAAACTC CACCTCQCAG ATTCAAGCG^ TTCPCATTAC 
82451 TCAGOCTOOC GAATAGCTGG GATTACAGGC AGCGGGCAGC ACAOOCAGCT 
82501 AATTTTTCTA 1 1 I I IACTAG AGATCGGGTT TOGGGCTGTT GGGCAGGCTC 
82551 GTCTCGAACT GCTOVCCTCA GGTGATCCAC GCAGCTOQGG CTOCCAAGGT 
82601 GCTGGGATTA CAGGTGTGAG CCAACAOQCC TCGOCXJTTTG CAI I 1 1 IAAG 
82651 ATAAAAATTT TACCATOCTG GATATATTCT AGTAGCTATG TACTTCAGTT 
82701 TCTCAATTCT TAAAGGAACT TAATAGAAGT ACTAGCTTAT AGAATTTTGG 
82751 GGATTAAATG AAATAGTCTT CTGAGCACAA CAAATATATT ATCAGCACAA 
82801 GAGCTAGCTC TATTGAGTCT TTATTATTAT AATAG CAGTA ATAGTCAGAC 
82851 TTOGAAGGGG 7GAAAGAGAA CAACAGTCCA TTTTAI II 1 1 GTGGCATATA 
82901 TATCATAQCT CGTAAGAGCT TCGATTCTTT AGATCCXATC TTAAAACTTG 
82951 ACAAAACTAG AAATCTTCTG AGTGTCCAAT AGCAGGTCAT AACTAATGCA 
83001 ATCATTAATT TATTCTTCAA TTTGATCAGA AGACAGACCT AACTTCATCT 
83051 ATTGCjCAATT ACTATTCTAA CAAAATOAT TGGAATTTCA OTTAGGCAC 
83101 TGCAGTACAA CAGTGTGAAT TTCAAAAGTG AGATATTTTA TCTGGCTTTT 
83151 TAAAGTAGGT TTTCAAACCA GTTAAAGQCT CTAAAAGGCA TTAAGAAGG* 
83201 TACTATCGGT CAGGAATAGC ATTTTATGAG GACTCTPGAG AAAAGGGTCC 
83251 I GP GTOOGCr GCA<XATGGT AGAAGTGGAG GTTACTCTCA CTTCTTOCT 
83301 TACTTGCTAT CTLILLI IGI TCTCTCACTT GCTATOGAGT GAGAGGGGCT 
83351 TCAGAGGGAC GAOTAAGAG ATTTAAAGGA TTOQCOOCAG TTGTGGGGAA 
83401 GAGAGGAGCA GGGTCTCACT CTCOGAGACA CGCAAGAAGG AGAGAACTCT 
83451 GGCTGGCTTA CTCTTCOGAC AGAAAGAGCT ATACTGTAAT GGTGTCTCTG 
83501 TTCAATTTCC CTAGACTOCA GCXTTTCTCA GGGCAAAGGA AATCTATTTC 
83551 CTGCAGATTA GAGGTGGGCC TAGGGGAGGA GAAGGCTAGC CTX3GAGITIG 
83601 ALIOITGIOC AAATAGATOC CTGAAAGGGC AGTGAGGTTC CAAOGGTCAG 
83651 PG 1 1 PGAAT GACAGGCAAA AAGGCAGGAG AAGAATGAAT TATCCAAGTC 
83701 AATGGTGCTA TGGOCAGAAG GAACTCTGAG GTGAGACTCT GAATAAGCCA 
83751 TGAAAGTGCA TCTGAGAAAA AAGAATTAGC TTCAAACATC AGCAGACAGG 
83801 PGTTTTO GCA GGAGGGCTCA IGI IGGAATC 7XXTTGGCTT GGTGGOCAG 
83851 CCTOCATCTC OCATCCCTAC CTTCGAAAGA TCOGAGAGCT AGGGTTOGTG 
83901 TAAAGTCGTG GAAGAGAAGA AAAGCAGCTG ATTTCAATGC CTTCGCAGCT 
83951 TOCCTGGGGC TGGGGAAAAG ATTTCATTAC CTAGTGAATA I IGUI I IGI I 
84001 ACXTAAAGTG AQCATAAGCC M ILI I IA AC ATTGAOCAAA AGAATCAAAT 
84051 GGGGC1CTTC AAGIGI I CAT CTAGTGTCAA GGGAAAA7TT TTOCCCACTG 
84101 AATAAATTTT AAGAAGGCAG TCAAGACAAG AAGCTATATT 7GATTATATC 
84151 CTCTTAGTOC TTATTCAATA GAGAGATAAA TCTCTAATTT TTAATATTTC 
84201 GTATAGAAGT AGGTTCAAAT OCACAGTAAT TCACAGAAAC TTCTGCAAGG 
84251 G III IG II I I C I 1 1 1 11 I 1 1 LI I 111 MM MIMMM I TTTGAGACAG 
84301 AATCTCACTC TCTO000CAG GTTCGAGTAC AGTAGGATGA GCPOGGCTCA 

84351 croovGocnc GACCTGGCAA GGTTCAAGCA a i ih i g i g c ctcagcctcc 

84401 TGAGTAGCTG GGATTACAGG CATCAGOCAA CAOGQGOGGC TAAI 1 1 I IGI 
84451 AM 1 1 1 AGFA GAGAGGAGGT GTCTCCATCT TOGCCAGGCT GGTCTTGATC 
84501 CTCAGCTCAG GTGATCTGOC TGOCTTGATC TOGCAAAGTC CTGGGATTAC 
84551 AGGTGTGAGC CAGCATOOOC GGOCAAGGTC I 1 1 1 MLMG AAAATATCTT 
84601 CACTCATATA AGCACTATAT GCAATATAAG GATATGCTCT TCGGI I ILI I 
84651 GATCTGGTCT AATATTTAGT GTPGGGOOCT TAATTATAAA AGI IGLI I I T 
84701 ATCTAAAAGT AGATCTTAGT TGTCAGGCAA IGI I IG LI GI AAAAAATAAA 
84751 TAAAAAATAA AAGTAGATCT TAGGATATTT ILI I L I A GOC TCCCTAGTAT 
84801 TTATATTAGA I ILM ILI 1 1 TTTTCAGAAA GOGTCTOGCT CAGCTGGGCA 
84851 GGCTOGAGTG CGCAGTGGCG CGATCTCGGC TCACR5CAAC CACCATOCC 
84901 GGGGTTCAAG TGATTCTCGC ATCTCAGGCT CCTGAGTAGC TGGGATTAGA 
84951 GGCAGOCACC AOCAOGQCTG GCTAAI MM CTATTTTAGT AGAGGOGGAG 
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85001 TTTCAOCATG TTQGGCAGGC TQOTCTCGAA CICLICACCT CAGGTGATCA 
85051 GCGCACCTGG G0CP00CAAA GTGCTAGGAT TACAGATCTG AQCCAOCGCA 

85101 ggcagoctag attctttot aaagcatag* tctctgaact rm pgaatg 

85151 AAATTAAATG ATTAGAGATT AGTAAAATTT TTCTATAAGA TAGTAGACTA 
85201 ACAAATTGT ACTAGTCTQG GTGTOGTGAC TCATCOCTCT AATCCCAGCA 
85251 ATTTOQGAGG CCAGGGTOGG CCG4CCACTT AAOCQCAGGA GTTTCAGACC 
85301 AGGCTGGGCA ACATGGCGAA AOL" I IGILIC TAGAAAAGAT ACAAAAATTA 
85351 GCTGGTCT G G TCGCACACAC CTCTAGTOCC AGCTACTCAG GAGGCTTAGG 
85401 TCGGAGGATC GCTTCAGOCT AGAAGGCAGA GGTTOCAGTG AGCAAACATT 
85451 GGGTGAGCGC ACTOCAGCCT GGGTGACACA GOGAGACCXT GTOCATTAA 
85501 AAAAAGAAAA TTTAAOCTCT CTGAAGCTCT GCATACTCTA AGGCTCTGCA 
85551 1GIL I I G ATT GGATTCTGCT AATATATTTC GCCAATCAGC TOCTTCCTAC 
85601 TCTCT A CTTr tcaatogctg tcaccaocat CXAAGTCAAG ATGAGAGTOT 
85651 TTACACAGTC TCT7CAT7CT Gl 1 1 IA AGAT TATAGTCI IL TTTOGGTCA 
85701 GGGMGAAAG AAAATAGAAA TATGGCTTAC 7GATTOQG0C A7GQCTTAOG 
85751 CCTGCAATGC CAGCATTTTA GAGGOCAAGG TGGGAGGATT GCTOGAAGCC 
85801 AGGAGTTOVA GAGCAGCTTC AGTAGCAAAG TGAGAGQCTG TCT OTACAA A 
85851 AGflAACACAC ACAAAAGAAA TATGACTGAC TAAAATAGAT ATAAI I I ICA 
85901 TAATACTTTA AAATCTAAGA AGGCAAAAAA I I ILIGGGCT CAAGGTGGGT 
85951 GATGGCTTCA AGCTAGGAGT TGAAGAGCAG CCTGGGGAAC CTGGCAAAAC 
86001 CTTC1 I IL I A CAAAAAGTAC AAAAATTAGC CAGGCATGGT GGTGCACAGC 
86051 TGTGCTTCTA GCTACmGGA AGATTGAGGT GGGAAATTTC CTTCAGCCTG 
86101 GGCTGTO G AG ATCAOVGTGA GCTCAGATTC CAGCACTGCA CT0CAGCCTG 
86151 GGCAG09GAG TCAGACCTTT TCTCAAAAAA AAAAAAAAAA AAAGGCAAAA 
86201 AATTAAATTA TTAGTATOGT AAAGI I ICJbl TTGGACTTAA TATCAAACTC 
86251 ATTTCTAGVA ATGA7GATCA TTTGCATAGG GCTTAACTTC CTmGGTAAG 
86301 AAAATAGAGT AGTATACTAG QVGACTPOCA GAGCPGCATA GAGCTTCAGG 
86351 GPCATCTAJOC AAGACAG4CA ATTTGTTGTC ATCATCAGTG TTAAACTCTA 
86401 AATTATTAAG TGCTTATGTG OCAGAXACTC AAGTTTATAT ACALI I ILIL 
86451 TAATCTTTAA TAATTCTAGA AAGGTATGTG TT7GATCCAT TTTCAAGATA 
86501 AG^AAACTCA TTOVGAGGGG AAGTAACCTG AGCAAGAATA LGI IGIACTT 
86551 GTAGAGCTOG GAATATGACT CAIGILIGI I 1 1 1 P0CATAG CCCAI 1 1 1 LA 
86601 CTTAGGTrOC ATTAGTTTAT TATATATTTT AGAAGG^CTT AWGACTPGA 
86651 rmOCTCAGC GAAGTCAGAT TACTCATTTC TTTGGCACAG AATTATGATT 
86701 ATTTGTATTT AAT A I 1 1 CI I TCCTG1 RGG1 TTTCATTTTT GAGI 1 1 I IAT 
86751 GGAATCTCAC U ' CTO ATTCT CAAI I I II 1 1 A TTC TCA AAT ATAACATAQC 
86801 TTCAGAAAAA AAGATTAAAC ATATCTATGG TTTTATAAAT GATTATAAAA 
86851 TAAATAOGCA GTAACTATTA TCCAGGTCAG CAAATATTCT ACTAGTCTAT 
86901 GAGTCAATTT GCATOGCAAA AGAACTAAGC TTAGGCACTA TACTCAAAAA 
86951 AACTAAAAAT AAAAI 1 1 ML TAAATCTCTA TTATATCAAT GGAA TAAATA 
87001 CAAATATAAC TTAOCATCTC ATAATTOQOC GCAGGCTTTC CG ILI 1 1 IA 
87051 CAGCATOGGT AGGI ILILIC TOCATGGGGA TCA1 1 1 ILI I TTCCTGCGCA 
87101 ATAGTCAGOG I L I PCACAGA GCTATTTOGT TCTOGGAAAA CAGCTGTOGT 
87151 GGGTOCTX3CT GTTGGATTTG TTGGGCTCAT GTCCAGTVCT 1 1 IGIAAGGT 
87201 AAGGACTTOG 1 1 1 1 1 ILATG I IGLI 1 1 1 IA AAAACTCTTA GATAGOTAA 
87251 AGTTTTACTT TCAGAAACTA 7GCTATTTAC AAGCAAAGAT CCPQGTTTC 
87301 AM 1 1 1 1 A AA ACTTTAAGCA ATATGAOTA TAAAAGAAAC 7GTTATCCAT 
87351 AGCAGCAAAT CAGAGCTTCA GAATTTGAAT GLII II 1 1 1 1 CTTCTAATGC 
87401 CTAAGACTTA GAGATCAATC AGGATATATC ILI I ILI HOG TGCATOGAAA 
87451 A A I GI I I L I L CTTAILI 1 1 1 ILI ILIATTC ACATAAAAAT GCTAATGGCA 
87501 CTCAAGTTAT AACAATGATT TATCTAAAAC AGCAGOCTTA GTTTAGGGTC 
87551 AGTTCAGFCT GAGGCGCAjQG GGTTACAATA AGIGGIGI 1 1 TAAAGTAGCT 
87601 GTTCATAGGC TTGATATGAA ATA I I I I IGI TAATGAGAOC AAAACTTTOC 
87651 CATTTATTOC AAOOCAGGTA GAGAATTOCT GTLIGI ILII TAAAAAAAGA 
87701 ACATGCTAAA ATTTTAAAAT ATCA7GGCAA AATGAAGTGG TOCAATCTAC 
87751 CTTAAAATAA AACTTAATCT CAATCTAOT CTXXTCTATC TATTAGAATA 
87801 AGGATCGOCA AO0GCTGTX3C CACAGACTGG TACGGGTCCA TGGC3CTGTTA 
87851 GGAATTOGGC TGCACAGCAG GAGGTGAGCT GTGGGTGAGT AAGCAAAGCT 
87901 TOCTOATAT TTATAGCTGC TOGGCATTAC TTGCATTAGC AGCTGAGCTC 
87951 GAOCT0CTOT CAGATCAGTG GCAGCATTAG Al ILI ILI AG GAGTGCAAAC 
88001 C3CT A TTCT G A ACTGTAOVTA TOVGGGATCT AGGTTGCACT GGGCTTATCA 
88051 GAATCTAAAT GGCTGA7GAT CTGTCACnT CTCGOGTCAC AGGOVGATCT 
88101 GACTGT CTA G TTTCAGGAAA ACAAGCGCAG GATTCGOVCT GATTCTAGAT 
88151 TATCATCAGT TGTATAATTA TTTCATTATA TATTACAATC TAATAATAAT 
88201 AGAAATAAAG TGCAGAATAA ATCTAATGCA CTPGAATCAT GCCTAAGOGC 
88251 GCTCCOCOQG CAGCATCCAT GGAAAAATPC TTCCPGAAAC TGOTCTCTGG 
88301 TGCCAAAAAG ATTOGGGATT CCTOTATTAG AAGAAAAGAC ACAGTGCACT 
88351 GAAGAGGGAC TCACTGTAAG GAACGGATCG GCACATAACT GCATGGAACA 
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88401 ICICP G GC TC CAGCTCTAGT TCTXjCTTGTA ACTGGCrTOC GmGGTAGAA 
88451 TAATCCTCAG TTAG4CAAAC ACTGATTTAA TATOTAQCTC TCGCTAA CAG 
88501 GAGGTGATTA AGAAGAAAAC CICI IAAGAT GATTTOCATC CI I IGICICI 
88551 ACTTTAGTGG TTTATCTTCA I M L C IG ULI C I I I CC I GIC CCTAGCTCTC 
88601 TTTATOCTGC TTTAGTTGAA AAGCGTTAAT GIGGICATTA AQGAAAAATA 
88651 ACTCAAATTT ACATTTGACT TTTTAI Mil AAATATTTAA TCAACAGAAT 
88701 O G PGG1 1 1 1 ACTCATTCOC GCCCCOCACC CXXCAACACA OATOCCTTCC 
88751 TCAACTCTAA AGTGAGGCTC ATTCTTTCAT ATTTTCTTCC ATCTAATTTA 
88801 GAAATTCTAT TTGGAI 1 1 1 1 AAAAATTATA TTTATTTOT TCTAGAAAAT 
88851 AGAT A1 1 1 1L ATCTTTAAAG TACCTTTATC GGTTTTTTCT TCTAAAATTG 
88901 M ill I A AAG AAAAAAGTTT TATTTOGAAT AAGAT7CTTC TAGGTAATTC 
88951 CATGAGATCA TTTATTTTAG CAGCAACATA ATATTTACAT TATTATTAAT 
89001 GTAATTAATC TTATTAATAC CTCATCAGAT AGCTTOTTC ATCTOGAAGC 
89051 TTOCAGGXAC CTATTCTCAG TAOTCTOGC TCTA0CACTT GCOGAATGTA 
89101 TTACAACTCT AOTTCTOGTA GAGAGGGGAC TCAGAGOTAG ACAAC TTATC 
89151 TAATCTACTA CCTAOTTTCT TAACAAAACA CACATACAAA GC AA1GI 1 1 1 
89201 TCAA A I 1 1 I I CTGACCACTG AGCAATAAAA ATTATCAGAT ATATTTTGAT 
89251 GTGACGCAGT TOGTOCrC 1 1 ICICIACC CTCTAAGTGA AACAAAATTT 
89301 ATTGAAAOCA AAATTCGCTT ACTACATGTA ATATTCI CAT ATATTCTATT 
89351 AAATTTCGTT ATTTAGCTTG CTGATCAAAG GCTACTGAAA CTTCAGAGCA 
89401 AGATACAGGA GCAAGGGGAA ATCTGCTATA GATTC7GAGT GTGAAGTGGC 
89451 AGGTCCATTT TTTOCTCTAG CP0CAGTTCT GGCnOGAG GAAAACGTTC 
89501 TGCAACAACT TAGGTCAATC ACACGCATCT CXOTCTCTC AATCLI I I I I 
89551 GCACATATGA TTCGTATCGG AGAGCCTTAC TCATTTAGAT TCCACTTATT 
89601 TOGCTGGCAA ACGTCACAAA CPGGAACCAT GTGTTACTGA AGGGAAAACC 
89651 TGGAAGTGAA AA GGCT TCAG CAGTAGTGCA AATAOCATCA TAAAGCTCAT 
89701 ATACrnCACT CTGCAGGAGG GAGAAGCTCT GTOGTTTTOC aactcagagc 
89751 ATTAGAGTAC AGTGATAOCA CPGTACAGGA ACTGATCTTC CTCATCATTC 
89801 TGCTGTGAAC AGTAI 1 1 1 IA ATATACACTT TGAAGAAGGC AGAGAGAAAT 
89851 GTATAATAGA CTTAAATTTT I I ICI I IAAA ATTCTTAAAT AAAAACAAAT 
89901 AAGCACTTTA AGTAAGTTAC AATTATCTOG AAAACTACTT AGGTX3GAAAA 
89951 ACTGATAGAG AATGAATGAA GTATTAATTT CIGI II IGI I CTGTGTTATT 
90001 ATTATTTGGG ATAGATG1C1" IGI I ICI I IA AGCAGACTAT GAATAICI IG 
90051 AAGGCAGAAC CACAI I I I I I I I I 1 1 I I IGA GAGAGGGTCT CACTA7TACT 
90101 CAGGCCAGAA TGCAGTGGTC TTATCATAGC TCACTGCAGC CTGGATTCCT 
90151 GGGTTCAAGC OOTQCTOCTG GOOCAGCTTC CTGAGTAGCT AGGACTACAG 
90201 GCATGTGOCA TCAGACGCAG CTAATTTCAG OA 1 1 1 1 1 1 1 I I I I 1 1 IAAA 
90251 TAGAGATCGG GTTTTGCTAT GTTGOCAGAC TCGTCTCAAG GCATCCTGCT 
90301 GGCmQGOCA CCCAAAGTCT TOQGATTACA GGTGTGAGGC AGCAOGTCTG 
90351 GOCAAGGAGC AGATTTTTAA TATTOTTTC CACAATCTAT CTOGTAGACA 
90401 GTAGI IGLI I AATATCTTOG CTAAAGAAAG AGTOGAGATT CAGTAAAGGG 
90451 TGATCAGAGT GAGGTGAGAT TAATTTGQGA AAGGCTAGAA GTGATTCTTG 
90501 AGOCTGATTT GAAQGTGGTG CTAGCTGTGG ATTAATAGAG GGAGAAGGGC 
90551 ATCTCAGAGA GAGGATTOGC AACATGCCTT AATT7TATCA GATTCTAGAG 
90601 I P GG IATCA TTA0CTOVGC AIGI IGCTAG ACTAGCATTA TTATGCAAAA 
90651 TTTTAATTAT TAAGCAACTT TAATCTTACT TTOAACAAA I IGI I IGLI I 
90701 TTACTACTGA TAGGLI 1 1 IC AAAAAACTTT AACTAGI I 1 1 ATTOOTAOC 
90751 ATAAI IGI 1 1 CAAAGAACAT AATCATATCA TCCTTTATCT TCXTAAGAAA 
90801 TCTGGAATTA 1 1 I GGIIA AA CTCTAAGATT ATTTAATOCA TTATTCI I I I 
90851 GACACATOCA TGG0OTACA GCTTAGAAAC TQGGATCACT AAAGGAATAC 
90901 ACTTAATTTA AblCM IUG TAGTCAGAAT ATCAI I ICI I Gl IGICMGC 
90951 AGAATACTGA GAACAGTGCA GTACAGGGCG AAGGI IGGIC TACAG00CTT 
91001 AGGGCAGCAA AAAGAGGCAC AACTGCAOCT CTGTGCAAAT Gl ICLIGACA 
91051 TAAGCTTCGG GAAAAAATAT AAAATGOQGC CI I I ICI I 1 1 ACTAQLI IGI 
91101 TTOGTAAGTA CCTOGAAAAA CTCCATGAAA TAATTAGATT TCATAGTTAA 
91151 TTCTAACTTT TTTAAAAAAT Gl I ICATTGA GACTAGOTTT I IGGI I IGI I 
91201 AATTGAATCA CIGI IGATTT TAGQLI I LCI GGCACCAAOC TTTAI I ICIG 
91251 AGCIG1GGAG AGCACAGTTC TCACTCAGTG CIGIGIGLGI CAGCTGAAAT 
91301 OCACAGAAAG AGGTGGCTGA ACAAAATCAC TCATCACCTT AAIGGI I ATT 
91351 TTTCACATAT TCAGATTAAA TTAAAATAOG TTTAGTGCTA CATOCTTGAC 
91401 TTACTGAGTT I I P 0 G CTCTA 1 1 I I GG I IA A M I N I UM I I I IGGI I A A 
91451 CM I IACTTC TAGAAAATAT GTTGATGAAC AAAAAOGCAC TTATACTATA 
91501 AGATTTTATT CTAOCAAQCA CACAGTAACA ATATTGAAAG CTGC1 1 1 CCA 
91551 ICI 1 1 1 1 C AT CTTTATACAG TTCGATOGAG OCTCTCTAOC TTAGCTATCG 
91601 AATCATATTT GOCTGCGQCT GLILCI I IGC ATAGCAGGCT TCAT7GGTCA 
91651 1 1 T I G QGACA CTATTTCAAG AAGOGCCTTG GACTGGTGAA TOGCATTCTC 
91701 AC TO CT GG CA GCAGIGICI I CACAATOCTG CTGOCTTPQC TCTTAAGGGT 
91751 TCTGATTGAC AGOGTGGGOC ICI 1 1 IACAC AT7GAGGGTG CTCTOCATCT 
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91801 TCATC1 1 RSI rOCTTTCTC GCTOGCTTTA CTTACGGACC TCTTGCTAOC 
91851 AGTAGCAAAG ATAAAGAGAG TCGAGGTAGC GG ATCCTG OC TCI 1 1 POGAG 
91301 GAAAAAGTTC AGTOCTOCAA AAAAAATTTT CAATTTTGGC ATCTTCAAGG 
91951 TCACAGCTTA TCCAG IG IG G GGAGTTGGAA TAOCACTTCC ACI 1 1 I PGGA 
92001 TACTTTCTGC CTTAIGI ICA CTTGGTGAGT ATGGP0CTTC ACTGATCATG 
92051 AATATTACTA TTTAATAAAG AAAAACTTCT TTGAAGAGAA AGTTAGGTCG 
92101 AOTTAAACTT GGCCTCAAAC ATTATCCTOG TTGTAATTTT GGTATTCT7G 
92151 AAATGAAAGG TCTCTC A AGA CAATCTCAGC AC ATOCAT TA GACCACTAAA 
92201 OVGAGAGAGT A I CI I ICATA GTCTOCTTTC GTATTTTAAA AAOOCTGCAA 
92251 AO0CAG0CAG ACAOCATCGT GOCTCTCTAT GGTO3CAGCT ACTAAGCTGA 
92301 GGCAGGAGGA TCACTTCAGC GCAGGAGTTC GAATCCAGOC TAGACAACAT 
92351 AGAGAGACTC TAOCTCTAAA AATAAAATAA ATCTOOOCAA AGAAAGACAA 
92401 PGTTTTTTAA CAGGAAGGCT AAAATAGTOG AACAAATTAC AATCACTATA 
92451 AAAGATTTGA TAGGTCTCTT 1 1 I CI I CA TA TCGCTTTTAT GAGGGACAAA 
92501 GCTAGOGCTA TC ATTTTO CT ACCATAACTA AAI IGI 1 1 1 1 CAACX3GAAGG 
92551 GTGTAGGTAA TTAGCAAAAA AGCCATGA7G TTGAT/VCAAA GAAACATTAC 
92601 ATCTA L I IGI GGTACACTTC TOGGAAAA7G GGAATTOAT TCAGAG GAAT 
92651 ATCTGAGAAA AGTTACTCAA GATCTAAATG AGGAAAGAGA ACTATGGTTT 
92701 TATAGGAAAT TAGGATTTCA AOTGCTCAAG AAGTTTATAT TCTTTATTTC 
92751 TATTTCAAAG GCAAAATTCA GLI 1 101 1 AT ACTGAAATAC GAATAATTAA 
92801 7GTCTAGACT GGGGTTOGTC CCTCACGOCr GTAATCOCAC CACTTTCGGG 
92851 GGCTGATGCA GGAGTTCAAG ACGAGOGTGG GCAAGATAAG GAGACTTCAT 
92901 OCCTAOCTCG GGAAGGAAAA AAAAAAAAGA AGGAAGAAGC AGTCTCTAAA 
92951 GTATCTOOCC CTGGCAACGT I IGI ICAAAA GIGI ICATTA IGI I ICI HOC 
93001 M I NIUM TCTQGCTGAA AATCTATTTA CAATTCACGG TAAATCATAA 
93051 AAATGGCATT GGCACACATA I I IGTATCTT TCTCAACTTG GAM I I II IC 
93101 TAQCTTACAG TCTALI 1 1 IG GAGATTTCTG CAATTTTTCT TTAGTTAAGA 
93151 AATAAGTATA AATATAACCG ATTTAQGGAC TATCAGGCTA CATOCTGATC 
93201 TCATAGTOCA 1 1 I ICATACT ATTAGGAAAG TATAGOOGAA CCAACTTAAG 
93251 GTAAGTTTCC TGGAATATAG ATCTCTTCTG ACAGGATTAA CT7TAGCATC 
93301 GAAOCrCTTI CATAGCTPCr GTAGTCAAGA GAACATTTAT IGIGICCI 1 1 
93351 OTAAAAAGA TCAGTAGAAA I I CI I M ICI I 1 1 I J I ICI I I 1 1 ICCAGAC 
93401 AQGGTC1 IGI TAAGI IGCIC AGGCTOGCTT GAAGCAAGOC TOCTCGCTCA 
93451 GCTAGGATTA CAGGTGCAAG OCACCACACC CAGCTTTAAA AAAAAAATTC 
93501 rCTTTOGTAC TAOCACATCA ACACAGCTAG AGAAATCATA ACTCAGCTTT 
93551 GCTAATACTA GAGATTTAOC AAAGGAAAAG TOOTAGATCA CTGICIAGTT 
93601 ATTTTTCGTT ATATATTTAT AATTTGTAAA TTAATTTCAC ATATATTACT 
93651 TCATTTGACT TTCACAATAA AGCAGTAAAG CAGATAAAAT AAATATTAGC 
93701 TOCAATTTTA CAGACTGAAA AACAGATCTA I IGI IAATAG AGACGTTAAG 
93751 TGATTTTOCA AGAATTACAT GTCAGTAAAC AGCAGAGCAG GAGTTAGTTC 
93801 IC I l O CACre TGCTTAGCTG GTAGCAAAAT CAGTCTACAG TCTTAATAGC 
93851 ATATTGGGGC ACTTOOCTGG ATATATTAOC AAATCTCTOC ATCTTATTAG 
93901 GGGAAAAATG AGTATGGCTA AGGAAATTTA ATAAGCATCT TAI I ICI ICA 
93951 GCTAAATAAA ATTTCATAGT GGAAGGTCAG TTAGAGAATC TTATAGATAC 
94001 1 1 1 IGICATC AGGAGATOGC AAATCAGATG GTGCAlCAGAA CAATAAAGTC 
94051 ICIGI IAATT CIGI IAATAA AGCATCGCTT 1 1 1 ICIGCI I TCCCI ICI IC 
94101 OVQGCATCTT TTCTTAGAAA ATAIGI IGAC ATTGTTCATT TGAGAI I I IC 
94151 ICI 1 1 CI CAT AAGGGTGCGC GTTATCGCAC CGAA7GCAGC AOGGTAGAGG 
94201 AAAGATCAGA TAGCTAAATC GCATACAGGT GTTTAAATCT GCTCTTTOCT 
94251 TATCTACPGA G l I IGICA CT I I G I IGIA AT TTAAGGTTTG AATTATGGAT 
94301 ACTTAAGCAG GAATGGGAGA CTAOTTTOCT OCTTATAlCAG GGAAAAGGTG 
94351 TCTCATATOC TTCAAAAGAC TAGTAAAGTA GATGAIGI IC AATTCCTACT 
94401 AAAGOCTTTA TTGACIGI IG AGGGGAGACA TATATGAGAC GTAAAAATTT 
94451 GCTCTGAAGG AGCATAAAOC TAGTACATCT AATTAAAAAT GGCTACAGTT 
94501 TATAAAGCAC TTTTAGATAC Al I C I C I I A T TTAATATTCA CAACAATGCA 
94551 GTAOCTCTGG TCTATCCTCT TTATTTCATC GAAGGGAAGA CTAAGGGCOG 
94601 GAAAGATTAA ATAACTTGCT CAGGCAGGCA CAGTOGCTCA OGGCTATAAT 
94651 CTCAOGACTT TGGGAGAATC AAGTOGAAGG ATCACTTGAG OOOVGCAGTT 
94701 CAAGAOCAGC TTGAGCAACA TAGTGAGATT C3CATCTCTAC AAAAAGTAAT 
94751 TTAAAAAAAT TAICIGGGCA TCGTCGTOCA TGGCTCTGGT OOCAGCTACT 
94801 TCGGAGGCTC GGGTOGAAGG ATCGCATCAG GGCAGGAGGT CAAGGCTGCA 
94851 GTGAGCCATG ATGGTGQGAC TGCACTOCAG CCTGGGTGAC TCAGTAAGAC 
94901 OCTATCTCTA AAAAAAATTA TAAAGTATTC TAAAGGAAGA AGAGATTGAA 
94951 CAATTTTTAA TTTATTTGTC TCCTOCTOCT AGTGGGAGOC TTTTAAATAT 
95001 GGAAGGTGAA GAAATAAAGA GGCAGATCTG GTGGTACACA TOOTAGTCC 
95051 TAACTACTCA GGAGGCTCAG GCAGGAGGAT TGOPGGAGGC CAGGAGTTCA 
95101 AQGCTCTGGT GTXXTATCAT TCTGOCAOG GACGOCAGGC TGGGTAACAG 
95151 AGGAAGACrC IGICTCTAAA AAAGAGATAA TAAATAAAGA AGTAACTTGC 
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95201 TTGAGGTCAC AGAGATAGTG ACTGATAATT ATTACTCTAG TACTTTTATC 
95251 TAAGAGGCAG TATTGTATAG TCGTTTAAAA GTGAAGGTTC TQQGOCTOOT 
95301 GGGGTGGCPC AOOOCTCTAA IKTCAQCACT TTGGGAGGGC AAGGCAGGTG 
95351 TATCAOCAGA GGTCAGGAAT TTCTGACCAG OCTGGGCAAC ATGGTGAAAC 
95401 GCTGTOOA CTAAAAGTAC AAAAATTAGC TCGAGGTGAT TOCTTCCACC 
95451 TCTAATCTCA GCTACTCAGG AGGCTGAGGC AGGAGAATCG CrTGAACTTG 
95501 GGAGGCAGAG GTTGCAGTGA GCTGAGAG0G OGCCATTCCA CPCGAGOCTG 
95551 GCTGACAAGA GCGAAACTCC ATCTCAAAAA AAAAAAAAAA AAAAGTTCTG 
95601 AAGTAAGACA GATCTGGATT TAAATTCAGG 1 1 MGI I ICI TACTAGTTGC 
95651 ATAACCTTGG GGATCCTOG TAAGCATCAG TTTCCTCATC TATOGAGATA 
95701 AAGCCAATTT TOCAGAGTTC TCAGGATTAG ATAAAATCTA TCTGAAACAT 
95751 CTACCTCAGT TCTGGCATAA AAATGQGAOT TATTTTAATC TAAGGCAATG 
95801 TGATTGGCAA CTTCAGATAG AAGTAAATTT TGAAAGGAGA AAGATAATAC 
95851 CCATTTGGAA AAGIGGI 1 1 1 AAAAAG1TTC ATAGCATTOG AGTTOGGCCT 
95901 TGAGCATCAG ATTTTCTOTA CAAA7CTGAT CI I PGATCAA CTAGGGAACT 
95951 AACTTACCAG TTTAGGTCTT TCAAGATTCA GAAATACAAT GGAGTGCTCT 
96001 CATTGCTATG TTAAAAATTC TAAGATCTTA TTAGATTCTA CA TGATG ATT 
96051 TGAGAGAGAA TATGTATGCT TGCTTTCAAA GTGAGCTTGG AGGI 1 1 CATC 
96101 TTCTC G TAGT TGAOGTTTCA AAAAGAAGAA TTAGATTGGC TOCTCGAAGC 
96151 TAAATTTAOC I I 111 I MA G GCCTTCGCAC TTAAAATCTT TTTTAGAAGG 
96201 ATACAAATCT TATAGATCAA TTTAGATGAG GCXTAAOTT CTAAAAAOGA 
96251 TRjCTAGTAG CAGCTGCATC AG I 1 1 I IATC AATTOOOCCT TTPGOCTOAG 
96301 AGTTCTTTTG TTTTGTTTTC TCGAAICI I I TTTTCTTT7G TTTTGTTTTG 
96351 C I I I G II II I Gi l HUGH I I 1 I 1 1 1 G AGA GGGAGTC1 IG CTCTGTCT0C 
96401 CAGGCTGGAG TCCAGTGGTC CAATCCOGGC TCACTGCAAC CTCTACTTCC 
96451 OGGATTCAAG TGATTCTCCT GGCTCAACXT (XCTAGYAGZ TOGGATTACA 
96501 GGCGOCTGOC ACCACAGCTG ACTTAATTTT TTGTAI 1 1 I I AGTAGAGACA 
96551 GG GITTTO OC ACATTGGOCA GGCTGGTO0C GAACIGCIGA OCTCAGGTGA 
96601 TCCAG0CATC TTGGCOC3CC AAAATGCTGG GATTAOGGGT GTGAG OCACC 
96651 AOGCCTGGOC TCTGGGTTTC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 C_ 1 1 1 I I 
96701 AAGGGCTGCT CTGACPOCTC TCATTTAGCT TTCAGGAGGA TAAACTCTCT 
96751 TO GTTTTCTC CCTACCVOCA CATCACTTXT GCTTAGTTTC TTTGGnCACT 
96801 nCTTCTTTn GCCACTGACC GCTGAATATXZ AGCATCTQCT AGGGU IGIC 
96851 CCCTGATOT TTTCTOCATC TATTCTACTG GTOGTTTCAT CCAGTCTCCT 
96901 AAGTTCATAC ATCAOGTATA TCTCAATGAC TTCAAATTTA TAATTCTGGT 
96951 GCAGAOCTTT TCOCTGAATC CTOCAOCAGA GCTOTATATC CAGUGCI IA 
97001 CTTAACATCT GCAC1 RGGG1 AACPGCTAQG IGI I PCAGAC TTAGOCTCTC 
97051 TAACGCTGAG GILI IGATCT TAO0CCTTAA AACTTACTCT GOOOGCAGCC 
97101 ATOCTCATCT CAGGAGCTGG OAATTCGGOC CTTTCAGTTG ATCAGACTCA 
97151 AAACTTTGGA GTCATOCTTC GCICI ICI I I CTTOCACAOC ATAGTCCTGA 
97201 TOCAGTGAAG AAATOCTOGT GGLI I I ICI I TCAAAATATA TOCAGGATCT 
97251 GAGCACOCT CAGCATCCTC ACTACTCATA CCTTAGCOCA GGCTAOCAOG 
97301 TAG00CTAGC CTGGATCACT GQCAGAGGCT OCTAACTGGT CICICIGliC 
97351 CI IC r CTO OC O0G0GGAGTT IGI ICICIAT GAAGAAGOCA GAGGCATTCT 
97401 TTCTAAACAT AAGTCACRT GCTOVGAATC CTTCAATOGC TTCOCATTTC 
97451 GCTAAGAGTA AAAAOCAATA TOCTTACAGT GAGCTACAAG GTOLI ICACA 
97501 ATCTGGOGQC CACTAGCTCT CGGAGCTTCC ATOGCTGTOC CTTQQOCACT 
97551 CTGCTTCTGC CATTOGOGT TTAATOGGGC TCACTCTGAC TACCTGCTTG 
97601 AAA CI I CC I G CGTOOCTTTT CXX3CPGAGTA TTCACAAAGC GGPO CTAGTA 
97651 C I CC I II I C I 1 1 1 1 1 IGI AG CAOTAATAC TT7CTAAGAT TATCTATTTT 
97701 ACI ICI 1 1 AT TGTAGTCATT GCTTACTATC GGTATATTTA CACGTCTGCT 
97751 AGAATCTAAA GAOGACAAGG GTAAGGATCT ATTTCATTCA GTOGTAGATC 
97801 CCAAGCATCT AGCACAGTGC CTAGCACACA CTGGGTGCTC AAATAI I IGI 
97851 TGAATCACTA AATATATTCT GGGTGAGTCT GAAGTGACAC TCTATAAGTA 
97901 AIGI ICATTT TTTCATCATT TGG A I CI I IA AAATPCTOA CTTTCATGCT 
97951 ATAATGATTT TTCACATTCT GTACTTGCAG GACATOGTGT TATTAATATT 
98001 TATTCAATAC TTATTCAACA AATAAGCTCA AACTAAGGAA ACCTOGGAAT 
98051 AATTCAGTAA GCAGTAATOC TGTOOGTTGA TGGAGGAGAG AG I IGblG'I G 
98101 I M I GUCIG ATTCACTTAT GGCTTTCOTG AAATTTTAAG ATAAATAGAA 
98151 GAA A I IIC I G GTOOCTCAAG TAACTCTOTC TTCAGTAOOC ACTGAAAAAT 
98201 CTCAAAGACT CTGGAGTGGT GIGI I IAAGA ATAGGATGCA GGATOCAGAA 
98251 CCATAACCAG GCCTCAGGTC TGCATAGCTT TGGTOGAGCA TTCAGCATAG 
98301 GGGCTOGT G A GATAACTGAT AAATGCCAAA TATGACAATG ATAAATGCGA 
98351 AATATGAGAA TGATAAATGC CGAAGAATCA CAGTGACAAT GATAATGAAG 
98401 TTAGCAAAAA TGATGGTAAC I I MCICATT GGCATGAAAT GCTCXATCTC 
98451 GAATCTGAAG CTGATGATCT AGI I ICAGTT ACTCTCATCT CTCTCCGCTG 
98501 CTACTCAGAT TGAAAATCAG CTACTTAGTA G C I G I GI I C I TTGACTCTAG 
98551 AGCATATCAT TGGGTCAAAT TTCAGI I 1 1 I AAATTTTAGA TOCACATOGT 
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98601 iC I C I G T CA A GAAGATGA CT GACPCATATT GAAATCTGTA AAATATCTAT 
98651 TCATTAGGCT GTTTTTTAAA AACTOOCTTA TAAGTGGGTT GACI I IGPGG 
98701 CAGATACTAA TTCACTCTTC TCAAAAGAAA CTTTGAGCTG GTAGGAAGAT 
98751 O0CATTTAOC TCATCCTATC GTTCAAGAGA GVCAGATCAT TTGCTTGCTA 
98801 GCAGQQCAAT TAGGTGAACT TCAAGTGCAC TAGTAATTGG AAATGATTTT 
98851 ITTTTTTTTT TCAGACTGAG TCTCATTCTG TOGCCCAGGC 7GGAGTGCAG 
98901 CQGCATCCTC TCGQCnCACT QCAAOCTTCA CXTCCTOQOT TCAAGCGATT 
98951 CIICI G CCT C AGCCTGOOGA OTAGCTOQGA TTACAGGTAC CTCOCAOCAC 
99001 GGCTGACTAA I I I 1 101 ATT TTTQOTAGAG ATGGGTTTCA GCATCTTGGC 
99051 CAGGTTGGTC TCAAACTCCT GAGCTCAGGT GATCTGTCTG CCTGGGGCTC 
99101 CCAAAGTGCT QQGTTATAGG CATTAAGGAC OQOOCCPQQC CATGAATTGT 
99151 AM I I IAAAC CAGAAATGAA AATTTGAGAC TAATAAGTCA GTAGAGGGAG 
99201 CATCTAAACC TGGAAAGGTA I 1 1 I i I AGO" TTGAGTAGTG CCAGATGCTG 
99251 OCAAGGGTnCA ATCAACACTG GAATGTAGCT ATTAGAGCTT GCTAGGCAGA 
99301 GCAOCTCCAT TTACACTUTG GTCAGAGGAG GAGTACTGCT GCAAGGCAGA 
99351 GCTAAGGGGG CTCAGCOVOG CAAATAGGAA CAGCATACAA GOCTTCATCT 
99401 CIC1G IG GCT TOCTCAGAGG GAGATTCA7G TAACATTTGC CAAGAATTCA 
99451 TT A 1 GIGTCA AGCACTTGGC CAAAATCTCA CAGAAGCAGC ACAAGGATGA 
99501 GTGTAATAAA TAACACATAC TTAGAGGCAA GGAAAGAATT CTGCAAAGCT 
99551 GTGCTTOTTC AAAGOCATTC GCATTATOCT TAAAGCTGGG ATTTCAACAC 
99601 AGCTTTCAGA CAAATATCTC TGAAATATAC ICM 1 1 IATG AAGGAGTCTC 
99651 CAI1CJL1 ICA TTGCTAATOC AGAGATAGGA GTC C TGCTAT TTTCAGOCAT 
99701 ACTGGGGCTA CACCAAAGAT TOCTTTGCAC GTTTOCCTTC IGI ICICILA 
99751 GAAGGAAGAA GAGAGGGCAT GT7GAGCTOT TGCAGQGCTC AGAGCATGCT 
99801 TCAGAGGCAG GGAGAAAACT CTGGAGGAAA GCAGCTTTTC TTTTGATATA 
99851 ATTAATGGGA ATGAGAAAAT ATCTATAGOC TTATTTTCAG GCGCAACrTC 
99901 TCTTTTGATC TCAAGTACAT TCTGAATATG AGAAAACTGA GGOCATCCAG 
99951 TTACI 1 1 ICA GAAOCTGTGA CAAGCAGAAC ATGGAGCATA CATAGCTTTG 
100001 ICI ICAATTT TOLI I ILIAC ACTAAACATT AAGCATAACA GAAGAACAAA 
100051 AATOGACATG TAGAAATTTA TAGCAAGATC TATCCTTTAT TTGATTAAGA 
100101 TAAATACTAT TOCAGGAAAA TOGAAAAAGG TAAACIGCI I GAAATTTAGT 
100151 CAGATATAAA GGCPOGGAGG GCACTGGTGG ATCATTAGTC TCCTGAGAGA 
100201 GCTCTAAAGA ATTAGTCTCT TGGAAAACTG TTCCCTCCTG TTAATCTGTA 
100251 AATTTCAGCA GTGGCTTTTT 1 1 1 1 II 1 1 IA AGACAGGGTC TCACTCTCTT 
100301 GTXjCAGGCTG GTA7GCAGGG GTGCAATCAC AGCTCACTGT AGCmOGAGC 
100351 TCGGGGACTC AAGCAATCCT GGGAGCTCAG CCTOQCAAGT AGCTAGGAGC 
100401 ACAGGTGCAC ACCAOCACAC ATOGCTAATT TTTAAI I 1 1 I TOTAGAGATG 
100451 GGGTTCTCAC CACATTGGCC AGGOGGTCT CAAACTTCTG TCCTCAAACA 
100501 ATOCTCCIGC CTTCGCTGGC TCGCAAAGCA CTGGGATTAC AGG7GCAAGC 
100551 CACTACATCT GGGCTTCACT AGO 1 1 1 1 ICA I M IGM MG CATGTGTCTC 
100601 AGO I 1 1 1 ATT TCATAAAATC CAGTATACTT TCAATCATCT CAAATTTCAT 
100651 TTCTAATATG GACATTCGCA TCTCTCAAAT GCTTOGACTA ATAATCAAAT 
100701 TAAAG1 I ICI TCAAGTT7GA GGAAGCTAAA TTAGGCAATT AGATAAGGGT 
100751 OCT I ICATCT TTTTATATCA ACTAGAAAAT AAAI ICI 1 1 I GATATGGGAT 
100801 GAATAGAAAT AGAAATCTTA ATTTGAAGAA ICI ICJOGCTT GFGAGGCTAT 
100851 ACTAAATGGC TTTTOOCTGA TATTTAGAAG GTGGCTTTGG Gl PGTGGAGA 
100901 GAG I IGICIG ATOCATTCAG AGTACATTTC TTAOCTTCAA CATCTAGQGC 
100951 ATGCI I IGGG AGAAGOOCTT GVACTCACTA ACTCTAAGGA TCATAGAGCA 
101001 TAAGGCTAAG GAGGGCI ICI TATGTATTCA TCCTATCAGG AAGG G I C I 1 1 
101051 AGCAOCCAAA CAAAGI ICIA GGGGCTGTAC ATTGCTGA7G TCTTAAGCCT 
101101 CAGCTGOOCA TCTAGCATCT ATTTAGGGCT ATGCI I I OCX CACI I ICI AT 
101151 GOCTATCATT ATATCTCTGG CICI 1 1 IOGC GCTCTOGCT TOGGCAGCTT 
101201 ACTTCTAATT AGAAAGTTTA TATTOOCTCA TAACATATTG TAAAAGTGCT 
101251 CATTTAAAGG GCAATGCACA GCAAATTGGA GGTGTATAAT TGCAAAOVTG 
101301 GAATGOCTAT AICICIGI IA TGCAATOGCT GTATCTCTGT ATGCATGTTA 
101351 AATTGAACTG ATGCI Mill GAAGTAAAAT GGTAAGAAGA GTGGCAACAT 
101401 CTAGICI ICA GAGCATAGTT TAAGAI II 1 1 GCGCAATOCT GCAAGCCATC 
101451 CAATGGTGTG CTT7GAAAAC CACAGGTTTC TTTTAGAGAA ATACAACATT 
101501 TATTTOCCGC AM ICI II II GATTTAACAT TTTAGTTAAC AM II IATTA 
101551 ACATTTTAGT CTAGAAGATG OTOCAGATT ATCTCTCATT GGAGTCTCAG 
101601 GAGCACTGTG TGAAATGGGC AATATCAGGG CI ICIATCTA GGAAGAAAAG 
101651 AAGCAGATTT GGGGTGGGGA AACAACTTGC TCAGGGTTGC AAGGATGGTA 
101701 CATGGTGCAG GCAGGGCT7G AGCTTGGGTG TTCTTAAAAG TCTGGCI I 1 1 
101751 AAATAAAATA CTTAAGTGOC TGGCAAAAAA GTATAACATT AACTTAGGAC 
101801 CTGAAAGGCA TTGTAGAGAT CAGGTAGTTC CACTGCTGOC CCTGGGCTAC 
101851 AAAAAAAGAA AGGTAAAGGA AGGAAGGCAT GGAATAGTTA AGTTGCTTOC 
101901 CAAAAGCCAC AGTTATAAAA GTAGCAGAAC TGGGTGTAAT AGGCAAGAAC 
101951 ATOCATOGAA AATAAATGGA AGCTTATTAC AGCGCAGOCT GTAAATATCT 
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102001 AOVTAGAAAC AGAATCTCTA TCTAGAAACA AAATTATTAG AGGAGTCAAA 
102051 TTAGTTTCTC TOCTAACCCT GGTCAACTCA TAGQOTTCAT TTCGCAATCA 
102101 GGAGGTTCCT TTOOCATTTT GACATCATCC Al 1 1 I CI LAG AGTTGCCAAG 
102151 AGCAATTCCA ACTGGCTTTC MGILI I ICI TTCATOGATC TATTTCGAGC 
102201 TCAGQGACAA GGTTCTTATC C3QQCAGA7TG TCGCTGAGAA 7GCAAGGCTC 
102251 ACAGTAGTAT COTATAAAC CAAAGTTCAC 7GTAGACTGG GCCAGGAATC 
102301 GGCTQCTCAA GGTGAGCTCT CCTTCATCTT GGTTCGTTTA ICI I PG1 RQG 
102351 rTOGTTCTOT TTATCTCTGT TCACTATAAG GTTCTGAGAG AAGCAAAGTC 
102401 T I GGTCCGOG TTACATCTXX GCAGGCTOGT QGATCATQOT TACAGAGTOC 
102451 TGGAAACTTT TTTATTTATT TAAAAT9GAG TTACGTAACT GAAGAGTPGC 
102501 CCTATCTCAG TCAGCACTOC AAOCAATTAA AATAGAAAGG OTAAAATAT 
102551 TAATTTTCTT TTTAGOCTAG AGTCTTAAAT ACTAGQOTTT AAAAOTTTCA 

102601 TTTAcrrncr ctctcoctct ctcccpctct C30CTCTCPCC CTGGCTTCCr 

102651 TCTTTCTTCC CTXXCTOGCT ICI I I CJLI I I I I CC I 1 1 I IC TTAACTATCC 
102701 AGAATCCTTT GCTAOTTCC AGAATGCTTT GCTACTTTCC AAAATAAAGA 
102751 GTAATCACTT TOG 1 1 1 ICI TCAAGACTTT CAAATTTAAG TAATTTTCTT 
102801 I I I C I A TTTT TCATTCAAAA ATAGGCTTAT AACATTTCCT TGACACAAGG 
102851 ATATATATAT ATATTTAGAG ACAGGGTCTT GCTATCTTGC TTAGGCTGGC 
102901 CTTCAACTOC TOGTCTCAAG TCATCCTTXT GTCTCAGGCT TCCCAAGTGC 
102951 TCGGATCACA GGCATGAGCC ACTCTACTCA GOGTTATTA ATCTAAATAT 
103001 GATAATTTAC GCACTGAGAT TC A I I PGTG C TGATTAAATT TACTCTCAAT 
103051 COCTATATCT ATTTCTTCTA 1 1 1 I IU Ibl TCTTATCTCG CCTCGAAATC 
103101 1 1 I CCI 1 1 A T TCATTOVTTC ATTCCCmOG CAATTAACTT OCAAAAAGGC 
103151 TATCAAGATA TTTATOCACA TATACATTTT ATCATTCAAC TCTCATGGTA 
103201 AOL 1 1 1 1 ATA AGGAAAGCAG TCCTAATAGT GGTICI ICIA CTAATAGATA 
103251 AGAACATTCA GGCTCAGAGA GGTTTAAAAG GTTTCCCTAA GGTTGGAGAG 
103301 TTAAGTAACA GAGTTOOCAT TAGAATAGAT TTCCAGTTTA TTCTAATGGC 
103351 AAOCTTAATT ATTCTTCTTT GAL I IGCICI AATTATAAAT CAGTTAAACA 
103401 GATTCAGGGA TTCATTTGAG TATCTATTCT GTGGCATAGA CATTOAGAT 
103451 GTOGGGATAC AATCCTAAGA ACAGATACTT TCTCATAGAC AGCTCAATAA 
103501 ACAAOTAAAT TCATAAAGAA ACAAAGTAGT TTTCAAATAG TCATATTTTC 
103551 TTCAAAGGAA ACAAA1TGAG ATCGAAGGTG ATOGTAAGAG ATOGTAAGAG 
103601 ACAACTTTOG CGAGTCAGGG ACAACCTCAT TCAAAAGCTC ATAGTAAGTA 
103651 CATCOTTOG GTAAAGGGTA GTATAAGGTA CTTTGAAGGT ACAAAAATAA 
103701 GACAGCTTTC TATTOOCCTT GQGAGGOCTA TAACAGAATT TCTCAAGTPCT 
103751 CTAAGGCCAA TCAAGAGTTG GAM 1 1 1 1 !A TGCAACTTAT TTTTAATTCA 
103801 TCTATTATTA AAAATCTGCA TA TCAAA AAT GAAAATGTCT TGCATACT7T 
103851 GCTUTAGGAC OCAATCATTC I I 1 1 1 ICI IC ATATACRQCA TTAAICICI I 
103901 TTCACACTOC TAATAAAGAC TTAOCTGAGA GGAGGTAATT TAGGAAGAAA 
103951 AAGAGGTTTA ATOGACTTAA AGTTCCAGAT GGCTGGGTAG GCmCACAGT 
104001 CATGGTGGAA GATCGAGGAG GATCAAAGGC A I G 1 L I I A GA TGGTGGCAGG 
104051 CAGGGGAGTA TCTOCAGGGG AACTGCGCTT TATAAAAOCA TCAGATCACA 
104101 TCAGACTTAT TCACTCTCAC GAGAATAGGA CAAGAAAAAC CTCTCGGCAT 
104151 GATTTAATTA CCRXCAOCA QCTTOCTCCC ATGATATGTG GGGATTATCG 
104201 GAGCTAGAAT TCAAGATCAA ATTTOGGTAG GGAAGAGAGC CAAACCATAT 
104251 CAI ICIGGGC CTOGCTCjCTC OCAAATTTCA TCGGCTCACA TTTCAAAACT 
104301 AATTATCOCT TCCTAATAGT (XCTCAAAGT CTTAACTCAT TTGAGCATTA 
104351 ACTCAAAAGT CCAGAGTOCA AAGTCTGATC TCAGACAAGG CAAGTCGCCT 
104401 OPGGCTATCA AICIGTGAAA TCGAAAGCAA GTTAGTTACT TCCTAGATAC 
104451 AATAGGGGTA CATCCATTCG GTAAATACAC OCATTCCAAA TCGGAGACAT 
104501 TGGOCAAAAT AAAGGGGCTA CAGGCCOCAT GCAAGTCTCA AATGCAATAG 
104551 GGCAGTCATT AAAOCTTAAA GTTCCAAAAC GAICI ICI 1 1 GATTOCATCT 
104601 CTOVCATTCA GGGCACATTC ATCTAAGAGG TCTTCTOOCA TOGCCTTOGG 
104651 AAGCTCTCCC CCTCTGGCTT TCCAGGGTAC AACGGGGCTT CTOGCTGCTT 
104701 TCATOGGCTG GCATTCAGTC TCTGCAGGTT TTOCAGATCC ACAGTCTAGG 
104751 CPGTCAGTGG ATCTAGCATT CTGGGGTCTC GAGGAOGGTA GCC LIL I ICI 
104801 CATAGCTOCA CTAGGCAOTG OOCCACTOGG GACTOUTCT AGGGGCPOCA 
104851 GTCOCACATT TCOCTTOCAC ACTGCCTTAG CAGAGGTTCT TCATCAGGTT 
104901 OCTGOCGCTG CAGCAAACTT CTGGCTQGAC ATOCAGGCAT TTOCATACAT 
104951 CCTO G AAAT CTAQGP3GAG Gl PQGGAAOC TCAATTCITC Al I ILIGIOC 
105001 ACGOGCAGAC TCAACAOCAT GTGGAAGCTC CCAAGGCTTC GGGCTTGCAC 
105051 OCTCTGAAGC CATCGOCTCA GCTOTAOCTT AGCCOCTTTT AGGCATQGCT 
105101 GGTCCAGCTA GGATAGAGTA CACTAAGTCC CTAAGCTGCA CACAGCAGGG 
105151 GGGCCACACA GCAGGGGGAC TCTGGGGCTC GGGCATGAAA TCAI I I 1 1 IC 
105201 CI PCTOGGGC TOGAGQOCTA TCATGGGAAG GGCTAOGTTC AAGGTCTCTG 
105251 ACATGCCTTC GGGACJGI 1 1 1 CACCATTCTC TTOGCAATTA ACATTGAGCT 
105301 I CI RGTCACT TATCCAAATT CTOCAAA TCT CTCCAGCTOG CTTGAACTTC 
105351 TCCCCAGAAA ATOGG1 1 1 1 1 CM 1 1 ICI IC TTOCTTTTTl ATTATTATAC 
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105401 TTTAAGTTAT GGAATACATC TOCAGAAGGT GGAG G1 I IG I TACATAGGCA 
105451 TACATATOOC ATOGTAGTTT GCTCTATOCA TCAACCTCTC ATCTACATTA 
105501 GGTATTTCTC CTAATCCTAT OCCTQCTOTA GGGCGOGACT CCGCTGACCA 
105551 GGGCTOGAAT GTGA7CTTCC GCTOGCTGTC TCCATCTOTT CTGATTCTTC 
105601 AATTOOCACT TATCAGTCAG AGCATCTOOT GTTTGGTTn CTAATCCTCT 
105651 TTTAGTTTOC TCAGAATCAT GCTTTOCAGC TTCATOCATC TCGCTGCAAA 
105701 GGACATCAAC TCATCCTTTT TTATOGCTQC ATAGTATTCC ATCG TCTATA 
105751 TCTOOCACTT TTTCTTTATC CAGTCTATCA TTGATCGGCA TTTOQOTTOG 
105801 TTCCAAGTCT TPGCTATTCT GAATAGTCCT GCAATAAACA TATCTGTGCA 
105851 IGIGICTI IA TAATAGAATC ATTTATAATC CTTTQQOTAT ATAACCAGTA 
105901 ATCGGATTCC TCGGTCAAAT GGT ATTTCTC GTTCAAGTTC CTTCAGGAAT 
105951 TCCCACACTA PCTTCCACAA TCGTTCAACT AATTTACACT OOCACCAACA 
106001 GTCTAAAAGC Gl 1 1 1 IATTT CTOCACATCC TCTCCAGCAT CIGI IGI I IC 
106051 CTCACATTTT AATCATTCCC ATTCTAACTG GOGTGAGATA GTATCTCATT 
106101 GVGGTTTVGA TTTCCATTTC TCTAATGAGC AGTGATCATC M M 1 1 II IA 
106151 TATATATTTC TTOGCTCCAA AAATCTCTTC I I I IGAAAAG TGTTCATATC 
106201 CTTCACCVAC II 1 1 I CA TCG G G I I GI I I G I I I 1 1 1 I CI I G TAAAI I IGI I 
106251 AAAGTTCCTT GTAGATTCTC GATATTAAGC CTTTCTOVGA TCGATAGATT 
106301 GCAAAAATTT ICICLCATTC TCTAGGTTCC CIGI ICACTC TCATCGTAGT 
106351 TC C I 1 1 1 G CT GTGCAGAAGC GCTTTAGTTT AATTAGATCC AAI I IGICAA 
106401 ITTTOGCm IGI IGOCATT G CM I I Gb I G TTCTAGTCAT GAAGTCTTTG 
106451 COCATCCCTA IGICCIGAAT GGTATTOCCT AGGI 1 1 ICI I CTAAGCTTTT 
106501 TATO G I I C I G TTGCATOGTC AGGCTCCAAA I 1 1 ICJLAAAC TTTTATOCTC 

106551 TCcmocrcr tcaacacttt gcigciiaga aaiiiiiicc agoggatacc 

106601 CCAAATCATC CPCTCAAGTT CAAAGTTOCA CAGATCTOA GGGCAGGGAC 
106651 AAAATCOCAC CAGICICI 1 1 CTATAQCAAG TCTCACOTT ACTCCAGTTC 
106701 OCAACAAATT CCTCATCTCC ATCTGAGACC ACCTtAGAAG CCATTCAACA 
106751 GGTCTCT A GG AAGTPQCAAA CTTTOCTAjCA ICI ICCI I IC TTCTCAGGOC 
106801 TCCAAGTCTC TAGGAAGTTT CAAA C I I I CC CACATTTTOC TATCI ICI IC 
106851 TCAGOOCTCC AGA CIG I ICC AAOCTCTCOC TCTTAGGCAG TTOCAAAGTT 
106901 GCmQCACAT 1 1 1 IGQGTAC CTTTACAGCA QCAO00CACT ACCCAGTAOC 
106951 AATTTACTUT ATTAGTCTOT TCTCATGCTG CTAATAAAGA CATAGCCGAA 
107001 ACTQGGCAAT TTATAAAGAA AAGAGCTTTA ATCGACTCAC AGTTCCATAT 
107051 GQCTQQGGAG GGCTCACAAT CATCGTGGAA GGTGGAGGAG GAGCAAAAOC 
107101 AIGICI IAGA OGGCAGGAGG CAAGAGTCTG CATGCAGAGG AACTCCCCTT 
107151 TATCAAACCA TCAGATCTCA TCAGACTTAT TCACTATCAT GAGAACAGCA 
107201 CAGGAAAAAC GGACGOGGAT GATTCAATTA AGTCGCAOCA GGTCCCTCCT 
107251 AGGACACATC GGGATTATCG GAGTTACAAT TCAAGATCAG ATTTCGATOG 
107301 GAACACAGOC AAGGCATATT ATATACTTTA TTAACTACTT AICICI ICIA 
107351 ATAATACACT IGI ICI CATC TOGTTATTCT TCTTATTAGC 1 1 1 1 1 1 1 IGA 
107401 GATCAAGTCT CACT A I G I I G C3CTAGGCTAG TCTCAAACTC CTOQQCATCA 
107451 GGCATCCTCC CACATOGOCT GGCAAAGTGC TOGGATTACA GATCTGAGGC 
107501 ATCACTCCIG GTCTCATCTC GTTATTACAG TGAATGATTA AGACAGCTTC 
107551 ICICCI ICAC 1 1 IGICCCTA CTTTTTCTCA GATTTTTTTC AATCATCICI 
107601 ATC CI 1 1 ICI IGI 1 1 1 1 I IC TAGCI I I I C I AAOCTTCCAT TTAI 1 1 1 OCT 
107651 TCAGATCTCA ACAICIGCIG CAATTTGACT GTTCAGGTTA GTCACAATTT 
107701 GOQCATTTAT CAGI 1 1 IGIG OCTTAGGCAG TATTCAGCAG CATTCTCCTA 
107751 CTTCAGATCA ATAAGGATCT TTATTTATCT GAGCACTTCT TTACTCATTC 
107801 ATOGGGACAT TTAATATTTA CAGAACACTT TCATCAAAAC AAGC C I G I 1 1 
107851 III CM II CA AAATATAATA TACTAQCATA GGAACTTGAC AGAAGAGGTA 
107901 ATAATACAGA AGAAATCTAG AGAACTGATC ATCGAGAAAT AATTAAACTA 
107951 AAACAAAGCT GCIGCI I ATA GTAAGGTAGA GCAAGI I IGI CCIGIGI ICC 
108001 AAATTATACT TA9CCAAAAA TAAATATTTA TAGATAATTC AATAGTAGTT 
108051 TTTAGAAATC ATTCATCGAT TACTCAGQGG TGGAAATTAT OGCTCTAATC 
108101 TAGGOQGCAA ACTTCTAAAA 'TATTTATAAT TTCTCAGGGA GAAATAAATC 
108151 CACAAACATT TCAAAATATC TAATTTTAAC TTAAATTCAA AAACAGCAOT 
108201 AGTCTATTTT TCTTAGGOOC TTATATTOGG TAATATAGTC TATCTAAGTA 
108251 TOGGAAAGGC TCGTGAAAAC TCFGGATTTA TCTACAAAAT AGATTCGTAC 
108301 ATTOG9GAGT I I IGIGI G QG AAGTCTOCTA ACACCTOVGG AAATOCTAAG 
108351 GAAGAAATCG GTAQGGTAGA CTGAACACTC CACAAGGGTA GGGAGAGQGT 
108401 CI I CICI I CA TTOCTCTAGC CCCAGCACCT AGAATCTOCC TOGCACTCAA 
108451 IC I 1 1 IC I 1 1 O CIII I IIA A CTCATTTCAG GTTCAGTTTA TAGGCTAGCT 
108501 GAGTTTATAG GCTGGTAGAG TACAAGGTCT ACTA A I I I CI CTACTCATCA 
108551 CTACTTTACr TTCTAATAGC ACTTTATCTT CTTCAAGTOC 1 1 1 1 1 1 ICI I 
108601 IGI ICI IC AA TCTATAATAA GGTCAGAACA GTTTTATCTC CATCTTCCAG 
108651 GGAGAAGAGT TAGTTAAGAG AC 1 1 ICICAG GTCAGACAGA TATACTTAAT 
108701 GACAAGGTCA GGTTTAAACC TTAAATAATA GAAATAAAAG TCATTTTATA 
108751 ATTATCTAGA GTAGTTTCAA TGTCAAATAA CTTAAAGGTA TCGAAATX3GA 
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108801 TCGCAAGAAG TATAGTCAGT C1TQCTQGAG TAAAATAATG CGCAGTGCTT 

108851 TCTGOcrrcr cccagcpqct gcttocagaa gaaoggggtg tctcagtctg 

108901 AACATCACOC AACAAGTAGG TTAACAGATA TCCA0GG00C TCTTGACCCA 
108951 CATACATATC AGTGGGATTT AGAA7GCTGC CACATATTCA TCATPGAATT 
109001 TATGAAGCAT ATAATATOCT CAATAATAAA CCAAGTCTCC CTCTCGCAAC 
109051 TTCTTATCTC TQCmCTCTC AACAGATCTT MC I U I A TA TOLIULM AC 
109101 TOCTCAGGTG CTCTOCAGG GACT 1 1 ICA G TTCCIGAGCT 1 01 (XI 1 1 IC 
109151 AGCATTTTCT CAGAGGACAA TTCTTAGCTT OCIGI IGATT GCPCAAGCAT 
109201 TAAI IGU I I TTTCTGCCAG AT A 1 1 HOG 1 GCTAGGCTCT TGAGGCCTCA 
109251 GAGCIGI ICI GAATTATGCA GTGGGAATTG GCCAGGATTA GGAATCAGCT 
109301 AATCTOX3CA (XAOjGGCGC TTCCTTGTGA GGCA C I FOC AACTCTGCAT 
109351 O0CTTATACC TTCACAGCAA OGCTCTOT A C CCAAAGCACT GTCATACTOG 
109401 GTOO C POG I TPCTCTCCTG AAATAAAATT GCTAGATAAG AAGAGCTCTA 
109451 TATTOCAGGC CI IGICI I IG ATTTTAGQGA AAAAAAAGAA AACTAGCTAT 
109501 ATACATAA7G TTTTTAAAAA TCAGTAA7GT OCCACKJbll AGAGAAAGGA 
109551 GAAATAAAAG AAGTAACTTA ATGOCTOGGA TACGTGCTAC AACATGGATC 
109601 AAGCA7GAGG ACATTACACC AAG7GAAATA CTCCAGGCAC AAAAGCAGGA 
109651 ATACTOTATG GTTCCGCTTA GATCAGGTAC GCAGAGAAGT CACATTCATA 
109701 AATACTGAAA GTTCTATGGT GGTTPCCAAG GGGAGGGGGA AATCAGGAGT 
109751 TATTTAATCG GCACAGAGTT TCAGCTTCAG AGGAGGTOGT GAOVGTTCTA 
109801 CAACAATGTA AATCTAOTA ATATAGTACA CTTAAAATGG TTAAAATGGC 
109851 AAATTTTATG AAATAGGAAT TTATCACGAT AAAAAATTAA AAAGTAAGAA 
109901 AAGTTACTGC TTOGGGGAAA CTATATOVAA AAAATAAAAA TAGTTCG0CAC 
109951 AAATTTOCAA AACAAO0CTA ATCAGGTCTT GCTGGCTAAA TCGTGAAGCA 
110001 AATTCTGAAC CAATGTGTAG TCI I RG AGAC TGGGAAACTG ATGGQCAAGA 
110051 TTTTAGGCTC AATAAGGAGT AGAGTTCATA ATTTCACnDC AAAGACATTT 
110101 CTTTOOCTAC CATGCCAAGG CCATCTGATT GGGAGTCCAA AGAAGI 1 1 IC 
110151 ICICICCICI GTAGGCTGOC TTAATOCAGA GTACACAAGC CTPCCATTTT 
110201 CTTATCTGTC CTCCTAGCAG GGTCTOGP G C I I I ICCICCT GAACACTGAC 
110251 TGTATAATTA GCAGAGAAAA CTAAAGATAT TTAAAATATA GGCAGICCIC 
110301 TACATOCAAG GTTCCACATC CTTOGATTCA ACCAAOCATC GATTCVNAAT 

110351 ATTTOGGGGA aaaaaaaaca ataaaaaaac actoggctcg gcagcatagt 
110401 gagatggcat crtacaaaa acattaaaat attagctgag catpocagoa 

110451 CTTTOGGAGG CTGAGGCAGG CAGATCAGCT GAGGTCAGGA GTTGGAGAOC 
110501 AGACTGGGCA ACATOGCGAA AOOCTCTCTG TACTAAAAAT ACAAAAATTA 
110551 GOCAGGCATC ATGGCAGCTG CCTATAGTCC CAGCTACTCA GGAGGCTGAG 
110601 GCAGGGAAAA TTTCTTGAAC OOGTGAAGCA GAGCTTOCAG TTAGOCAAGA 
110651 TCGCAGCACT GCACTGCAGC CTGGGTGACA GAGTCAAACT CTCTCTCAAA 
110701 AAAATAAAAA TAAAAATAAA TAAAAATTAG CTGAGCATAG TCGCATCTGC 
110751 GCATOGTOGC AGCTACTTAG GGGGTTGAGG TCGCACTGAG CTGTCATOGT 
110801 GOCACTGCAC TCCAGOCTAG GCAACAGGGA GAGCOCATCT CAAAACAAAA 
110851 ACAATAAAAC AGAACAOVGA TTAAAAACAA AATAGAGGCT GGGCTCACTG 
110901 GCTTATGCCT CTAATOOCAG AACTTTGAGA GGOCAAGGTG GGAGGATTCC 
110951 TTCTGCTCAG GAG I 1 1 JAGA 7CAGCCTGGG TAACAOGGCA AGAOCACATC 
111001 TCTAGAAACA ACAACAACAG GAGACTATAC TTPOGGGAC CAI I ICIGGG 
111051 GATCATAGTT TCTTACTAGA GAA GM I CIC TCTGTAGAGC ATTGAAATAT 
111101 AAAAATCCAG AATAATCATT TACATAQCAT TTACATTGTA I IGbl IATTA 
1111S 1 TAAGTAATCT AGAGATTAAT TAAAGTATAC AGGAGGATAT ACATAGCTTA 
111701 CATCCAAATA CTACAGCATT TTATATAGGG GAOTCATCA TOCATAGATA 
111251 GGGGTATCTG AGGAGGTCTT GGGTTCAGTT CTCCAOGGAT AGCAAGAGAC 
111301 TAATCTTAAT TTCAI IIUJC CAAOCTOCAC ACCAGAACTC TCAAATAAGA 
111351 ATAAGAAAAG GAGCAGTTGG GATAGACAAT ATCAGAAGTA TCTGGAAATC 
111401 ATAACAGTCG AAGGAAAGCT GATCTAQGCC TACTCAAGAA ATTTTAATCT 
111451 TCATTCTGGT AAAAAGAAAT TAGATTTATC GGTOGAAATT TGAGCCAGCA 
111501 ATTAGATGGC TCTTAGGATT AATAAAAAAA GACTGAACAT CATGOCmQC 
111551 AAAGACTGAG GGAAAGAGAT AGATAGGAGA CTTTGGCAAA GTAGCACTTT 
111601 AGC3CAACATC ATTAGCXTAA ATCTTACTGA AGAGAGGTTA GAAGAAAGGT 
111651 AGAATTTTCA TOGAAGGATC C A I 1 1 1 IC I I CACTTCAGAA TTAAGGGAAA 
111701 AATTAQGAAG CTCAATAAGA ACTAATGGGC TA A I I I C I I I CI I ICI I ICA 
111751 AAAATCAAAT CTTTAAGTTA AAATTTCAAT AAOCAACAGA AGAAGGTAGA 
111801 AOCAAI I I IG TCAATTTACA AAATACTTCC TTATTGACAC TATTCGCAAG 
111851 GTCATTAAAT AAGTTATCAA GTTAOCTCAG ATOC3CAGTGA TTTAAATAAT 
111901 GCI I ICIGAA TCTATOCTTT TCIGI 1 1 IAA GAAGAAGCTG TATTAGGTTC 
111951 I PGTC1 ICCT ATAAAGAAAT AOCTGAGGOC GGATCATTTA TAAAGAAAAG 
112001 AGGTTTAATT GGCTOVQGAT TCTOCAGGCT GTATAGGAAG CATGGCCOCA 
112051 GC AICroCIC AGTTTTCPGC TCAGGTCTOG GGGAGCTTTT ACTCA7GGCG 
112101 AAAGCAGAGT GGAAGCAGCA GGTCACTTGA TCAAATTCAG AGCAAGAGTA 
112151 TOGGTOGGGA GCTGCCATAC TCTTAAOOCA ATCKTAGTG AACAGAAGCA 
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112201 ATAACPCACT TATCAOCAAG GGAATGGTGC TAAGOCACTT GTGATGGATC 
112251 CAOCTCCAAA ATCCAGTCAC CTCGCAGCAG GTCCCAOCTC CAACATTCQG 
112301 AATOVCATTT CAtfXTGAGA TATCGAAGGG ACAAACATTC AAACCATATC 
112351 AGAAGGCTAT CTTAGGCTGG GCAGGGTGGC TCAGGGCTCT AATQGCAGCA 
112401 CTTTCAGAGG CGQVGGCAGG CAGATCATTT GAGGTCAGGA GTTTGAGACC 
112451 AGCAQGGGCA ACATGGTGAA AGCCCATCTC TACTAAAAAT ACAAAAACTA 
112501 GCTGGGCATG GTGGCACACA GCTCTAATCT CAGCTACTOG GGAGGCTGAG 
112551 GCAGGAAGCT CTO PG AAOC GGTOGGCAGA GGTTGCAGTG AACTGAGATT 
112601 CPGGCACTOC ACTOCAGTCr GGGCAAOCGA GTGAGGCTCT GTCTAAAAAA 
112651 AAGAGAAGOC TATATTAAAC TTATAAAATT TAATATCATT TCAACTAG0C 
112701 I 1 1 I <j I IG GG TGCATTTGTT CACTTTGGAC TAI 1 1 1 1 OCA AATTCATCTA 
112751 GA GICG P G GA 1CICI IAACA ATCAGGATAT Gl ICIGAGAA ATGCATOCTT 
112801 AGGCAATCTC ATTOTTCTGC AAACATCATA GAGTGTACTT AGACAACGT 
112851 AGATGGTCTA GTCACTACAT ACCTAGGCTA TATCGCATAG GTAGAG0CTA 
112901 I IGCfXXTAG GCTACAAAGC TCTACAGCAT GTTACPGCAC TGAATCCTGT 
112951 AGGCAGTTGT AACACATOGT Al I IGIGIAT CTAAACATGG AAAAGGTACA 
113001 GTGAAAGTAC AGGATTATAA TCTTATOGAA TOGCTCTTAT ATATCTGGCT 
113051 CATCTTTGAC CAGAAATCTT ATTATGTAGC ACATCACTCT AATTCCACTA 
113101 TTCATTAGAG ACTC3CATAAC GCACAOCTAC CIGI ICA7TT GCATGCACAT 
113151 TTAGO0GATA GGTGACTTTA TCACTAGATC AGTCCAAGAA TAAGTTTAAA 
113201 QVGCATCTCC (XI 1 1 ICI IG Mill IAATA AATGTAAAAC ATCGAAGTAC 
113251 ATGATACCTC AAAGGAGCAT GAATCAGAAA CAGGAGTGCA CATTTACAGA 
113301 GGCTTTOGAA OCACTTTTAT ATCTTCAAAG 7GCTCTAGCA AAGTAT0GGA 
113351 GTCAGTCAGT CATOCAGGTA GGCTGGCAAC CATOGCTG4G CCCTCAGGAG 
113401 CTOTCTTCAT GAOCAGCATA GACACGGTCT Gl IGIGI I LA TTCAAACTTG 
113451 GCAGGAACCA AGGATCTGAG AATGGTAAGT C1GGIGTGCC AAGAATGAAA 
113501 CTOCAAGAAA GAAACTTOVG AATCAAATTT AAATTTAGGC TGGGTGTGGG 
113551 GGCTOVTGCC AGTAGTOCCA GOGCTTTGGG AGGCTGAGGT GGGCAGATCA 
113601 GCTGAGGTCA GGAGTTTCAG AOCAG0CTGG GCAACATGGT AAAACTXTTOT 
113651 CTCTGCTAAA AACACAAAAT TAGCTGGGCA PGGTGGTGGA CAGCTGTAAT 
113701 GCCAGCTACT TCGGAGGCTG AOGCAGGAGA ATTCCTTGAA TOOGGGAGGT 
113751 GGAGGTGGCA GTGAGGOGAG ATCACAGCAT TGCATTOCAG GCTGGGTCAC 
113801 AAGAOCAAAA TGCXATCTCA AAA A AAAAAA AAAATOGAAT GTAGGTGGAA 
113851 TTAATAAAAA TTTAAATAAA CTAACAGATT AAACCTTTCA I IGI ICAGGA 
113901 AGGAAGAGAA CAATTTTTGA TAACI IGIGA AATATCTGGC ACAGAAATTA 
113951 TTTAGAGCCA CTAAATAATT TCAAATTAGC TAAAAATOCT AGTGATTTAT 
114001 1 1 PCTATTTT AAGATCAAGT CTACTTTAAA CTTCTAAAAT GCAGGGTTAT 
114051 TTAAACTGGC ATCTAAATOC AAGCTGG1 1 I TGGTTGGTAA TTCCTCTAGG 
114101 ACATTTTACT AAA I CI I CAT OTATCTAAA TGATCCTATC TCATAGATGG 
114151 ACIGI I I'GIT I IGI I IGIAA TOCAGGGAAA TTAAAAAAAA AAAAAAACAA 
114201 GTAGAAATAA AGGLI 1 1 IAA AGAATTTTTA GAGTTAGAAA IGI I I ICAAA 
114251 ATTAG G1 IC I TTAAAGCATT AGQC A TCTCT TCCI I L I G AA CICI ICI 1 1 1 
114301 TTCTGOCCTT TOGTAGCTAT GAAATAATCT GCATTCCAGA AAC1 ICI I 1 1 
114351 TTCCCAGTCC 1 1 1 1 I CA TCT OTAACAGTC CCA7GCATCA TTATCTACAC 
114401 GATOGAAACC CATCTTAATG AAATOGAAAG AICIGUGI I TAAAAAAACA 
114451 AACAACCAAT CAO0CATG0C TCCTAGAGTC CIGI IAGGCC AAAGGTOGC 
114501 TCTOCACOG GTCAGCTGCT GAGGAGGGGG CAGGTAOCTG TAOCAGAGTT 
114551 TAGCCACAAC AGAAATATOG TTGGGCTAOC AAAAGA7GTC AG ICC 1 1 AAA 
114601 A AIGI IG GCA AGCAAAGAAA I 1 1 1 ICICIA AGCACATCTT AGAGATACTT 
114651 TAGTCTAAAG GCACAGTTGA TCAGATTAAA GAAGCTAAGA TTAGAI 1 1 IG 
114701 GGGCTGGTCT AATOGATAAA TATCTATCAG TGGGCATAAA Gl 1 1 IATTTC 
114751 AGIGI 1 1 ICI GGTOTAAAGG GATTAATTTT ATGTACTCAA GTCTATCGAG 
114801 M IG I II I CI TOCAOGTTA ATCACCTCAC TOGGCI PGCT GGTCTGCAGT 
114851 GCTCATGCAA AACTTOCTAA GGGAGTGGGC CACI IGI 1 11 AATTTACTTC 
114901 CTCTAGCAAA AACACAGOA GAGGAAAOVC AAA I CGI 1 IA AAAAAGAAAG 
114951 AAAAAAACAA AATAAACTOT CTCATTTCAT TCAGAAGATT IC1GI I CATC 
115001 CTACTTTTAA GTCAAAGGGT AGTCTTAAGT TTTGATTTGC AACAAACTCA 
115051 GATTATOCAT TAAOGGAOGC TTGCAGGCTT TCTAGPOOCr TTCAGATCTA 
115101 OGTCTATTTT TTTAAAGGGC GCAGGCTTTT TCAGTTTAOC TTTCAAAAAA 
115151 GTAIGI 1 1 AG GGAGAATGGT CTTCTAGAGT TATOGAATAG CI 1 1 ICAAAT 
115201 GTTOGA7GGA TAI I I ICI IA GAATTACTOC AAAATGCAGT AAATGAGAAG 
115251 CAACAAGCTG AAGCTGOCAG TCTCTGGTAT GAATAGCAGT GAGTAAAAOT 
115301 GC1 I IGAACT TAGTGTCCCT ATTGGAAAGC AGGAAGTTGA AAGAATCAGT 
115351 TTCAATTCCA TAA1" IGCIGT AAATATTATG TAGCATTTAT TATCOCATTT 
115401 TTTGATCTAC GATCTAAATC TAAAAGATAC AGGTOCGCTC OCT0GGCTGA 
115451 AGAGTTTATT CTOCTAGACA TAAAATGAGT GTACATGOCT AGGCATTGTC 
115501 CACC CC i nC CAGTTOGATC ATCTCTATCT GATCAGGAAG GACAGAGATC 
115551 AGAGAATCAA AATCATTATA AAOGAATACA GA7GATCACT GGGTGACTGA 
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115601 AATCTTATCT GCCAAAGAGC ATAGTTAACT GCAGCAGCAG TCGATGATAA 
115651 CTACTOQGTG GGGGTAGAGG GOTTGTTTTG CTCAGTTCTG CCTAGAAGAC 
115701 AGTTGTAGTT ATTTTAGTCC GAOVGTCTCT ACTCCTCGCT GGGCCTCTTT 
115751 TTQCOCTGCT I ICJLIG G CCT CTOTATAOOG CLI I Ibl IAG CALI I IGIAG 
115801 TTCTCAGTCA GCTCTTACTC CAAGTOCTCT GTCAGAAATA AAGTTACTTT 

115851 nsnrs MGA gtctocagta atocccctpc i i i i iaaact atgactccoc 
115901 aaagatatat agtctaaot gtgtgcagca gtattttatc atatttattt 
115951 aataaattcc ataaaoctta cataaaagat attaaacaaa aagtactcac 
116001 OOCCrrAAAA GGAQGAAGAT aatcagcatc aagqcatogc caaattacac 
116051 AGTCTTAOGA AACAOTACAG TAGOTAATTA TTCGACCTTT TTCATTAATG 
116101 GC ATO G PGC TTTGCAGGTG TCTACCACCA CTOCACTTCA CAGGTTTCTG 
116151 TAAATAAAGT GATTTOGQCT TCGU ICIW TCAGCATTCT TCTATCGTGA 
116201 AAOCATTTOG TGATGAGAAG CTTGAOTTTC TAATOCATCT TCGTTOQCTT 
116251 TATTGGAGCT G LI 1 1 1 G TA C TCTGGGCACC CATTTAACTT OCTCTGTOOT 
116301 TCATATATGA GAGCTAGAGG ACTOGOCTT AGCTTCTTCA TCAGAACAOC 
116351 ATCTGATCTC TOCAAATCTC TTACACTGTG GCTCGGCTCT AAGAGAGGTG 
116401 GAGAGGAGAG AGGGCAAGGC TCAGQOCATT GCAGTGAGAT GATCACTTTC 
116451 TTAGQCTACT GGCTGAGAGC TCTCAAACAG GATUG1CTG TCGGCTCTAG 
116501 7GAGQQGAAA GTAACOQGOC TTCAGTTCGC TOJPCAGAAA CAGTTA00CA 
116551 OGGCOGGGTC TGGTGGCRCA TCOCTGTAAT TCCAGCACTT TGGGAGGTCA 
116601 AGGGTGGAGG ATTGCTTAAG COCAGCAGTT TCAGACEAGC CTCQGCAACA 
116651 TAAGAGGAGC ATGTCACTAC AAAAATCATT TAAAAATTAG CTGGATCTGG 
116701 TCGCATCTAC CTCTGGTOGC A9OTACTCGG GAGGCTGAGG TAGGAGGATT 
116751 GTTTAAGGQC AGGAGGTCGA GGCTGCAGTG AGCTGTGATT GGGGCAGTGC 
116801 ACTCAAGGCT GGGGAAGAGA GTGAAACTCT GTCTCAAGAA GAAAAAATCT 
116851 TTCAGGCACA GTCAGTCTTG AAGATCTCAA TTAGCAAGGT I 1 1 I I IAATC 
116901 CCCTGGAAGT GTX3CAACCCA GGATTAAACA GATG1 1 IAAG AACATAAATA 
116951 AAAGATGAAT TCCFGG00GA GCGGGGTGGC TCACACCTGT AATOCCAGCA 
117001 OCTTOGGAGG CTGAGGCAAG TGGATCATGA GGTCAGGAGT TOGAGACCAG 
117051 GCTGAOCAAC ATGGTGAAAC GOCGTCTCTA CTAAAGATAC AAAAATAAGC 
117101 GGGGGGTGGT GGCGGGCACC TCTAATTOCA GCTACTCAGG AGGCTGAGGC 
117151 AGGGGAATTG CTTCAAGGGG GGAGGCGGAG GTTGCAGTGA GCTGAGATCA 
117201 TGOCACTGCA CTQCAGCCTG GGTGACAGAG TCAGACTOCA TCAAAAAAAA 
117251 AAAAAAAAAA AGATGAATTC CTGACACTTA ATACATTTAG ATAAAGTCTA 
117301 ACTT A1U I I AAAGTTAACT TTAAACAACT GOXTACAGC ATAATATTAT 

117351 eeeoGTXjTA gagagtgagc tctagtataa gctttaaatt ggtttaaaaa 

117401 ATAGATTAAG TATCAAAAGA 7CTCTTCAAC ATTTTCATAT CTCTACAACT 
117451 CTTTATTAGA AATATOCATT TTCTOCTATA TTCAU I IU GCTGAACTAA 
117501 AGTCCATGOC AGTCTO CA GA GGATGAGAGA TCGGCAGAAA TTTATAAAAC 
117551 AGCAGCAGTC ATGGAGAGQC TAGGOCAGAA GTAGAACATA GCAGAGCPOC 
117601 TGGGTACTAC TCTCTCAGTC AGAAAGGATC CAGAGAT0CA GCATCAGCAG 
117651 ATTCCCTCAC ATACTGAGCA AAGAAACTTT TTTAGT TTGC AGATTTTOGT 
117701 GGGAATAGGG AGGAGTATTA GAAGCTGAAG TTCTTCCTTT ATTTAAATAA 
117751 TCA7GGCAGT AAGAAAATTC GGTTAAATAA TGGCGLI IU AGGOCAGGQG 
117801 TGGTGGCTCA TGGACTTTAG GAAGGQGAAG CAGGCAGATC ACTTCAGGTC 
117851 AGGAGTTTTG AGACCAGTCT GGCCAACTTG GTCAAACTTC ATCTCTACTA 
117901 AAAATACAAA AATTAGCTGG GOGTGCTGGT GCACAGCTCT AATO0CAG0C 
117951 ACTUGGAGG CTGAGGGAGG AGAATOGCTT GAAOCAGOTA GGTGGAGGTT 
118001 GCAGTGAGCT GAGATCACAC CACTGCACTC CAGCCTGGGC GAGAGCGAAA 
118051 GGGCATCTCA AATAAATAAA TAAATAAATA AATAAATAAA TAAAATAATC 
118101 CCOCTTCTAG TTGAAAAACT AAGTTOCTAC CTAAGCATAA TTTGGATTTA 
118151 CGCAATTTAT CTTCTTTCAA AATACCTCAA ACATTTCAGC TTATTATTCT 
118201 TTTTAAGGAT TACAAAGTAG AGCAGGGGGG AAATAATAAA CCACTAATAA 
118251 AGAATAATAG CCATTTGACA GACAGGTCTT CTTAGTTTCA T A AAAAAAAA 
118301 GATCGCTGGT AGATTCAGTC TTTTATGAAT ACTAAAGAAT GOCTCTTATT 
118351 nTCTTTCn GGAGAGAGGT TCATTTTGAA CCTAACCP3G IGI IUCAGG 
118401 GAACATAGGT TAGAGGGGAG AGATTTAGGA GTGAGGGCTC AAGCAAGAAG 
118451 CATGTTAGAA GALIGCTCTA GTGGTCGOGG 7GAGGAGTGA AAGGAATOGA 
118501 ACTAAAATAC CATGAGGAAC GTGGGCCAAA GGAAACAAGT CTGAGAGATT 
118551 TAGAAAGTAA ATCATCAGGA TTCAGTGGTG ACTCTTGGAT TTGAAGAGGG 
118601 AAAGGGAATA ATCTAGGGTA GCTCTCAGTT TCTCCCATOG GTAGTTGGGC 
118651 TAACAGTAGT GTATTAACTG AAATOGGGGG CAGGCAATTT GTGAGGGTGA 
118701 GTTGAGTTCA GCAGAGGGCC TCTTGACTTT GAGGTCGCTT TCAAACAGGA 
118751 GTGGATATCT CTAATTTTAC ATGAAGTCTC TACTTAAGAG AATCTGTAGA 
118801 GAGATTAACA GGGCTGGGTA GGAACAOGGA ATAGCTCAGG TGGCAAATGA 
118851 AAACTTTTOG CAATAACAAG AAGTTAGGAA GT7GAAGAGG GTAGAAGAGG 
118901 CAGCAGAAAT GTAGATGGAT AAGAACATGT TOGC ATC1 IT AAGTGAGATT 
118951 GTGAATAAAC CTGTCCTACr GGGGCAGATG GGATTAGGGC TGAGATCAAA 
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119001 GCAQQCCTAA TCCAGTTTCC AGAGGTCTTl 7GTTAGACTT T7CTAGAGCT 
119051 GGATCACACA GTUTOCTAAA TTTAAATTAG GTGGCAAGAT TTCCAGAGAA 
1191D1 AATCOGGATT TCTGACTTTT C I 1 1 I A AACT AAAGGGCTCC TAGAGGTAGA 
119151 TTTCGCAACA TTOGTTAGAGC TATATCATAA TAATCGACTG AAGTATATCT 
119201 C C l MC I CrC CAATCAAOCT GTTTTAOTA TCTTATTTGG CTGAGCGCTT 
119251 AGAGATTTAA AI I TIG I ACT I 1 1 1 1 1 1 1 I I AATCCAGGCT TATAAGTCAG 
119301 ATCAATTTTC TAG I ICIG4G TCAAAGATCA CTAGGTAATA AAGGTACAAA 
119351 GXTAGATTAG CAACAGATTA GGGAGAGCTT IGAATAGCAA TCTAAGGAGT 
119401 GTAAATCTAG GCAGTOGGCC AGCTTTGAAT AAGGAATTCA 7GAGATTAAA 
119451 GCCATATTTA GGAGGATTAT TCTGGAOCAG TATCAAAACA CAGAAGTTAG 
119501 GGAAAACAGT TAATACTTTT GAAAGAGAAG AGAAAAAGGA GATCGTGTTC 
119551 GGATAGATAA ATOQGCTTTT AAAATGCAAA A7GAGAAGTG TTTTAAAGAG 
119601 ATATCAOOCA GAAAGTCTAT GCACTGOCAC ATOGGCACTA TATGGGTGGT 
119651 TCTATTTOGT QGGAAATTTC CTTOCAGACT TOCAGAACTC AGAGCAATCT 
119701 G1GGIGIGGG GGACGG7GAT TCTCAQGCAT TATCGAAAGG TCAAACAAAA 
119751 TATGCTGACT GGCTATCTAT GGOGCACAOG TCACTCTAOT CTCTUTTATA 
119801 AGTAGACTAA GTGGAGGAGA AAGGTOG I I AAAAAAAAGA AAGCTAAAAT 
119851 TAATAOCTCA I IGI IATTAA CTGTGTGCCA AACACTC1 PC TAAGCTCTTT 
119901 ACACAGACAT TTTATTTAAT OCTTDGCAACC AATTTOGAA GTAGGTACTT 
119951 7TOCAATTTC CATTTTACAG ACAAAGAAAC 7GAAAGCCTA GAGG1TAAGA 
120001 AOTTATOCAA AGOCACAAGG CTGATAAGAA CAGAACCAGG ACTTCAACGC 
120051 AAGCAGTCTG OCP C I O CAGA GGTTTATCTT TTAACTGCTA TCTTAAACTG 
120101 OQOCTOCATT TTAATCTCTT CTAATGCTAC ACAGATAGGC AACTTTACAG 
120151 GTAGAGGAOC TTATCCTTTA TTCTOGATGC TCTGTTATAA CT0G1 I ICAG 
120201 GGGTGTCAAT TTGGTCCAGG TCC1CCTGGA AGAAATAAAA CFQCAGAATT 
120251 GACCCT7GAA CTCTCTCTTA GGGAGGAGAT AATCTAAOGG GTOC1 PGGGT 
120301 GAOCTTCAGA GAACAGGTAT GTTCAAATGT CIGI ICICI I COTTAGCTA 
120351 ATGGATOVGT GTAGCTTATA ATTCCATCCT TCTAACCCTT TGTTCAAAAA 
120401 TAAAAACTCT TATAAACATC <_l 1 1 1 1 1 1 1 1 TTTTTOGAGA OGGAGTCTTC 
120451 CTCTGTOGGC CAGGCTGGAG TCCAGI I GIG CAATCAGCTC ACPQCAAOCT 
120501 CTGGCTCGCG GGTTCAAGCT AI ICICCIGC CTCAGGCTOC GGAGTAGCTG 
120551 GGATTACAGG CATCCGTCAC CACGCCTGGC TAAI 1 1 MAT AM I I I AGFA 
120601 GAGATCAGGT TTCAGCAGGT TGGOCAGGCT GGTCTOGAAC TCCTGGGCTC 
120651 AGGTGATCCA CGCAGCTCGA C3CTC30 CAAAA TGCTOGGATT ACAGGOGTGA 
120701 GOCAOCATCC GCGGOCTTAA AAATCCI I 1 1 AAAAATGAAA ACTAAAACAT 
120751 OTTAAI MM TTCAAATCTT TTCATGAAAA TTATCAGAGG AGAAGTTTCA 
120801 TAAATATTGA AATTTOGAAA AAGTTCCAAG CXTATAACAT TGGAGAGAAG 
120851 CAAATGCATT TCATCCAAAG GCTCAAATTT GTCAAGI 1 1 1 TCTAGCATAT 
120901 TCAGTGTGGT nCTTTC T CT TTGGCCTATA GATCAAACAT GTAAATCAAA 
120951 GATTTCAAGA TCAAAAAAAT AAAGAGGTTC TTCTCATGTG CATTGGGGTC 
121001 ACTPCAGGAG TTOGAGGACT GCICII IGGC CGGATTCCAG ATTATGTGOC 
121051 TCGTGTGAAG AAGGTTTATC TAGAGGTACT I I I MAOVZC Mill ROGOC 
121101 TATOVAAAAT TACTCTCATC AO0CAATCTC TCATTAAA7G TACTTAGATC 
121151 CTTAAATTCT Ml Ml MCI I HOG I 1 1 PC 1 1 1 1 IGAGAT GGAGI I PQGC 
121201 TCTTATTGCC CAGGCTGGAG TGCAATGGCA CGATCTCAGC TCFOCGCAAC 
121251 CTTXACCTGC GGGGTTGAAG GGATTCTOCT GCCTTAGGCX CCCAGGTAGC 
121301 TOGGATTACA GGOGTGTGOC ACCACAOCAG GCTAAI II II GTAI 1 1 II II 
121351 AOTAGAGATC GGGTTTCTOC ATCTTGGTCA GGCTGGTCTT GAACTCCTGA 
121401 GCTCAGGTGA TCTCGOOGCC TOGGGCTCCC AAAGTGGCTT AAAI ICMCI 
121451 ATAAAAATCA GAAATATTTT CTACAACATA AC I ICIATAG GGA G I I M I C 
121501 AAGGACAAAA TTAGTTATTA GTTTGGGTTT TAAACATGAG AAATTOGCAA 
121551 TCAAACAACA AM I CI I I GI TTTCTOGTGG AACTXXAGCA AAGCAGAATC 
121601 Gl I I ICATOC AMGCMIII CTATGAAGAA IGI MM IGG TCTAGI ICIC 
121651 ATAGTCATCT GCAGATCCTG TGGGCTTTOC AIGICMATC AAATTTOGTT 
121701 GTGTCTGTGA CI 1 1 ICAGCT TCTTTAGPGC AAATTGOCTC CICGIGI 1 1 1 
121751 GGGGTGAGCA TAAACAAATC CTAATTOCAA GATCATTGCT GACAATCAAC 
121801 AGAACAGGTA TTGAAGTGAC TOCTTCATOC CACACACTCT GCTAAATCCT 
121851 GAAGACTTAA GTGAAACATC GTCTCTGGCC TCAGCTAGTA GCTAA7GGTC 
121901 TCATGGGAGA AGATAAAGCA GIGI IGCAGC AGAGTGGAAT AAOGGGTTTC 
123951 CTAGATCAGT GOCATAGCAA CACAGGGAGC ATGCATCTGA GTCACTAGTG 
122001 AAGGGCTGAG CAGAGTTAAG AGGTGAGGAC CAAGTGTGTA GATAAGGGAG 
122051 ACAGAGAAGC ATOCTOOGAG CAGAGACAAA CACATGGGAA ACACCTOCCA 
122101 CACTCTGGAG GTGTGACATC GGATCTTATA TCTGGGAAAC AAAIG1 1 1GA 
122151 ATGGAATAAA GGGAGAATAG TOGATOGATT GGTTOGGGGG ATOGACAGGA 
122201 AGCAGCTTGG GAAGGGOGGT TATCTACATC AGCTCTATCC TGCAATCTAC 
122251 TAGGAGCTAT TAAAGGATTT TAAGCC AGAG AGTGACATAA ATTGGGGTGG 
122301 CGGGGGTTCG TGI MM IGG I II II IGAGG CAAGGTGTCA CTOGGTTTOC 
122351 CAGGCTGGAG TACAGIGGTC CCATCAOVQC TOVCTGCAGC CTCAACATOC 
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122401 GGGGCTCAAG CAATCCTCGC ACCTCAAACT CCTCAGTAGA TCGGACTA CA 
122451 G GTO O STO OC ACCATOXTG GCTAAI 1 1 1 1 GTTTTTTTCT I 1 1 1 1 1 ICIG 
122501 TAGTAACAGG GTTTOGCTOT GTTGCOCAGG UGblCTQ G A ACTOCTGGAC 
122551 7XXAAGGATC CAOGCATTTC AGGCTOOCAA AGTX5CTGGOT TTACAGGTCT 
122601 GAGGCATGGC CAGCCTGACT TCTGTTTTAT AAAGATCAAT TTCGQCTCTC 
122651 CGTOGTOGCT CATCCCTCTA ATCGCAGCAC TTTCGGAGGC TGAGGCAGGT 
122701 AGATCACCTG AGGCCAAGAG TTTCAGATTA GACATGGTGA AAOCCOGTCT 
122751 CTACTAAAAA TACAAAAATT ATOCAQGCAT GGTOQCACAT TCCTCTAATC 
122801 CCAGCTACTC AGCAGGCTGA AGCAGGAGAA TCACTTGAAC CTGAGAGGCA 
122851 GAGGTTACAG TCAGCTGAGA TTGTCOCACT GCACTXXAGC CTGGGTGACA 
122901 GAGCAAGACT GCATCTCAGA GAAAAAAAAA AAATCAATCT GGTAACTCTG 
122951 AAGAGTGGAA TCGAGTGGAG GAGGGGACAA GAGTTGAGAT AGTCACAAAA 
123001 GGOCACATAC TGAATCATTC CATTTATATC AAGTCTGCAG AATAGGAATC 
123051 TCTGTCTCTT TOCATAGAGA AAGAAGGTAG ATTTCTGGTT GTAGQQGCTA 
123101 GGGAAAGGAG GGAATGGGGA GTAACTGCTA CTGTCTTTCI TTCTGGGGTA 
123151 ATCAAAATCT TATAAAATTA GATAGTCATC AAACTCACTC AATTCTGCAC 
123201 TTTAAGTGGG TGAATTTCAT AGAATATCAA TTATATCTCA ATAAAGCTAT 
123251 TTTTAAAAAG GGCAGGCATC GCAGCTCATC GGCATAATAT CAACACTTTA 
123301 GGAGGCTGAA GCAGGAGGAT CACTTCAGGC CAGTTOGAGA CAATCCTGGA 
123351 ACCATAGTGA GACCTCATCT CTAOCAAAAA AAAAAAATTT TTTTAAGTTA 
123401 GGCAGGCAGG GTGGCACATG GCTATAGTCT TAGCTATCCA GGAGGCTGAG 
123451 GTGGGAGGAC TTTCAGOCTA GGACTTTCAG ATTACAGGGG AGTATOGTCA 
123501 TGOCACTCTA CTOCAGOCTA GGTCACAGAG TCAOAOCATC TCTOAAAAT 
123551 TTAGAGAAAA AGAGTAGAGG TOCAGGGACT AGTTGGAAAC TATATTAAAG 
123601 TAGTCTAGGC ATCTAGGCAG GAAGAAOCAA GGCAGAGATA CTGAGATCAA 
123651 ATOGAATAAG CTAGGGACAT AGAGAATCAG AATTTCATCA AGCCAC TTAG 
123701 GCPGGATCTG AGATTTCTGG CTCAGGTAAT GGAGTAGATG ATTTTGOCAT 
123751 CTTATCAAGA TCAATATCCA GTAGATOAA GTATGATCAT GAATTOGGGT 
123801 TTTTTAAATT AAAAAAAAAT TTTCAOOCAA OCTAC3CTO0C CTTGCGXAG 
123851 TCATCAGTTT GUM IGCCA PS1 l b I I PG1 GATGGCTCTA GGAAAGOCAA 
123901 GTAGIGI I IA GAGCTCATOG AAGAGTCTGG ATAAGAGGGC TCTGGGTTAA 
123951 GAGGOCTGTG GGTTAGGAAG OCTGTGGGTT TOGCATTCAT GGGCTGCATC 
124001 AAGTAAGTGG AGGACTAAGG AGATGGAAAG GGAGTGGOCA GATAGGCAGG 
124051 GGGAAAACAG GGATGAGAGT Gl I ILIAGGG GCAGGAGTGG TCAGCAGTOT 
124101 CAGTAGCYAG TCACAAAGAG GCTGAAAGGT TTTCAT7GAA TTTAGAATAG 
124151 GGAGGOATC AGTCAGCTTA CAGAGAATCG TTTCTACAGG TCGAGAAAAT 
124201 TCATCATGAT CATTTGAACA AA7GGAAATA TAGACAACTT TCTCCAAATC 
124251 CTTGGCTCTG GAAGGAAGGT AAAGGCAAGC CACAAGGGGG GGTCTTAGOT 
124301 TTTCAGGATT AAGO CIGICT AATTATATTA TACATCAGAG GCAGTATTTG 
124351 AGCAGGGCCT TGAAAAAGAG GTGAAATTTG GACAGAGGAA AATCATTGGA 
124401 AAAGGCATTA CAGATAAAGT TAACTTOGAT TAATTCACTT GAAGTAATAA 
124451 CAGTCAACCA TTTATTCAGG AC I I ICATCT GGGAGAGAAT GAACTAAGGG 
124501 CTOACATGC ATTATCTCAT TCATCCTPGC GGGAGGCCTT TAAGAGAGAA 
124551 GGTAGCATTG TTATTTCCAC TTAGAGATCT GAACACTGAG GAACTGAGAG 
124601 GCTAOCrcAC TCTOGOTGTC TCTCTOTATA GAGGTOCAGG CAGTCAGAGG 
124651 ACAGTCP GG T CACACAGCTA GAAGGAAGTG AAGAGAGFTG GTAATCTOCT 
124701 GCTTTTAAAA ATCTATTTAT TCTATCATCA TCACTTTCTC AGTACTTCAC 
124751 TCTGGAOATC CTCATCTAAC ATTTAGI 1 1 1 Gl IUUAOT GTCAAGGGAG 
124801 CXXTOAATG GAGATTTATT GCACTACAGA CTTTCAGOT TCATACATTC 
124851 AAAAATTGAG TGOLKXIGT GOOCCAAGCA CCAGCPCAGA TCUGI IAGG 
124901 GTCATCCAAA CAAGAGAGAC ATQGTOOCTG AGTTOCTAAA GCAAGOCTGA 
124951 GGCAGGAAGA AGOGGAATCT GTGTGGAAAC GCAAGAAGAT GGGGAAAGTG 
125001 GCATOGGAAG GGACTGGAAA GTTAGAGR3G GTCAGATTAG ACAGAGOGT 
125051 GAAAGGCAGG ATGAAGAGAC I I IGL I I I A A GAGTAGTGGA TTTTGGOCGG 
125101 GGGCAGTOGT TCAOQOCTGT AAOGGCAGCA CTTTGAGAGG CCAAGGCTQG 
125151 GGGATCAOGA GGPCAGGAGA TOGAAACACA GTGAAAOQQC ATCTOACTA 
125201 TAATTACAAA AAATTAGCCA GGCTTGGTGG CAOGOCjTXTG TAGTO0CAGC 
125251 TACTGGGGAG GCTGAGGGAG GAGAGTOGCT TGAATCPGGG AGGCAGAGGT 
125301 TCCAGTGAGC CGAGATOGCA CTACTGCACT OCAGOCTGGG TGACAGAGTG 
125351 AGACPOOGTC TCAAAAAAAA AAAAAAAAGT AGTGGACCTT IGI PGGAGGA 
125401 GGTAGGTCTG ATTT7GTTTT TAAAGTCACT GTCGTTCTTC TGTCTTTGGA 
125451 GGATGGATTA CAGGGAGGGA AACAAGAGTG CAAGAGGGGA GGAAGAOCTG 
125501 TTAGGAGATC AATTOVGGGT CCAGGTCAGA GATCATOGAC TATGGTGCTG 
125551 TAATAGATAA AGTCATGATA TATTTTATAT ATXZAGCATTT TATTCCAAAA 
125601 CAATCTTTGA AOGTOCP0CT ACOCTAGATA GGGCAGACAG ATTATCTGCA 
125651 AOATTTTTOG AGGACATTTT AATAOCTCAC IGI I ICCAOT AATTTAlCAAA 
125701 AGAAAATATA GCCTTTCCTA AO IGIOGCA IGI IGOTCTG CAGTTACACA 
125751 GCAGTAAGTT AAAAGTTAGT ATTOGGGGTC AAATATTTCA CTTTAGATCA 
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125801 AACTTTAGCC ACAA TCTOGC TTCKHTAGG GCTTATCTAA 1 1 I I IGCATC 
125851 CAAATCTAGA GCATOJTTTG TCGCAGOCAG TAGCACATGC TCAGTCACAG 
125901 GTGTOAGAGC TCGATTTCAA AGAAGGCTGA GAAGGAGAAA GAAAGCGCIT 
125951 CAGVGVGTCC TCTCGTTCOG AGGAGGCACT CAOGGACTOC AjGCTTCTCAA 
126001 GAOVGGGGCA CAGGAGGCAA CACAQQOCTA ACGCATGCAG GATGCTGGAC 
126051 T0CTTO0GT ATTCACTAOC TOCTCTCKT OLI I 1 1 1 CAT GGC1 PGCTTC 
126101 CCCCAACATC (XAAACACAC AGTCTGGTTT TTGATTCTTG GCTCTTCPCT 
126151 GCTCCAGTTC ACT CGAGC TC TCGCTGTTTC I l O L ML Cl I I I ILI CXA TC 
126201 GCTOGCTTCn: TCOLI I I IGG COCATQCTTT AAGGCTPGGA ATCCTOCTGG 
126251 GCTTAOCmC TTTCCATTCA TTCG TG GTCT GTTTTCTCAG ACATACCTGC 
126301 TCCCTGCTTT CAICI I ILAA CTTCTTOGG G CTGTGACAAC TCTTCOCTCT 
126351 TKSTOOOCTC GAAGCCAGTT CPGAGTAGCA GGCAGGCCTA GAACACTGGT 
126401 GACACAGACA CACTTCATAG COCPOCOQCA TGOTCTAGIT TCAGATCATC 
126451 CTAATCOCTA GTCXAGGAGG CTGGGGAGGC CAAQVQCACA GGCTCTCGAG 
126501 TCAGAAGOGA GTTCAGCCGA CTTCTACCAC CAACTTGAGA ATCAGGAGAG 
126551 GGCTGTGGTG AGGATTCTTC GTGGCAGGTT ATGTAAAGTC CCTAGCCCAG 
126601 ACTOGATCAT TAATAAATCC TTCCATCTOT TATCTTTTAA TATCTTATTA 
126651 AATACTGAAA GOVGATCCTC ATTTCGAATA GGTOOVAGA AAQGAGACTA 
126701 GGGTCTAATC CTGAATTAGA GTC1 I PQC1 I ATAGGTAACA AAGAATTTAT 
126751 GAATTTATCT CAGATTTTTG CTTAAGAGTT CTGAAT7TAA ACTTCCATCA 
126801 AG GTCCTO GG TCOGAGGTGT TTTCAAOCTA ATAATTCTAA ATATTCAGTG 
126851 GTTOGTTCCA GTAQCTTATG CCTATAATCC CAGCGCTTTC GGAGGGCAAG 
126901 GCAGAAGGAT OCCTTCAGOC CAGGAATCCA AGAGGAGGCT GGGCAACATA 
126951 GTGAGAGGPC ATCTXTAGAA AAAATTAAAA AGTTAGCTGG ACATGGTGGC 
127001 CTACAOCTGT AGTGGGAGCT ACTTCGGAGA TTCAGATOGG AAGATTOCTT 
127051 GAGCTGGGGA GGTTCAGGCT GCAGTGAGCT GTGATCATCC CACR3CATAC 
127101 CAGTCTQQGT GGGAGAGTAA GA L 1 1 IGICT CAAAAAAATA AAAAAATAAA 
127151 AAAAAATGCT GAGTCAAGTC TACTGCTCCT GGCAGAAGAG ATCACTGAAG 
127201 TGCATTAOGT AGAATAATAA TGOTTAAAGA AAAGCTTTGC AAAAGTTCOC 
127251 AGAATATATA CTTTATTOGG AGAGGAGAAG CTAOGTCTUl GGTGGTATCT 
127301 TTTTACTATT I ILI IAATCT TATAGGOCTG TCTT7CTAGT CAOCATTAAA 
127351 TTACTACAGA TTTGTGTTTT TAATGTAATA TATAAGTGTT TTGGAAGGGT 
127401 GAGAATATTT CAAAGGTTTC AAAGTTAAAA CTG1GIATCA AAGAATTGAA 
127451 AACTTGGAAT TTAGATCACT TTTCCATTGT GTCATATTTC TCTOTGACAT 
127501 GGACATATTG AAGCATOGAC ATCATGTC3CA TGCACATAAA GCAGACAACC 
127551 CAGACAACAC ACACATCTOC ACAGGAAOGC TCTOGAAGGA TCCAAAGGGA 
127601 ACTGTTAAGT G77GATTOCT GAGGAGGTT5A ATCGGCA7GG GTTCAGAGTG 
127651 AGAAGAGCTT GTAQGAAGAC TTTCATATAT TACTOTOT A C AI MUC TAA 
127701 GGTTTGAATT TTAAAAAATA TATTCATGAG TTACTTTTCT AATCTAGAAA 
127751 TATTAATGAC TTTOCTATOC ATCAGTCTGC CTAAGCTTOC ICI PGGGGTT 
127801 CAGGTAGAAT GAATTOGATC AGIGI ILCTC CATTTTOCAT TTTAGCATTT 
127851 TACATTTGOC TAAAGATATC TTCGGATCAG GGTATATACT TTATCAAATC 
127901 TAAGCTATTT CCAAAGTAGV AAATOCAAAT ACTTAACAAC TTOGCAGGCT 
127951 AAGTAAATCA TCAGAGGCTC CGTACTCAGT CTCATCTAGA CTGTGGCACT 
128001 GGGTGTGAAC GTATCAAATC CAIGI I ILIL CATCAGGCAG AAGTGAGACT 
128051 AACCATCTGC CATGGAGAAG GTTGACAGAC TCGCTGTCAA GCACTTOGAA 
128101 GTGACACTCG GCTOGTGTG CTTCAGAAGA ATOCAGGCAC CTGCTUTGTC 
128151 GGCTGACATT I I (JL I II ACT TTCTGATOGG GGAGCAGAAC PCTCTTOOCA 
128201 ALIGI I I ILI GTOCTGGGTG GCTAGCCAGA G G1 PCTGAAA GTCTOGAGAC 
128251 TTTATATTGG CTAAACTTTA GGAAGGTCAA TTACATCTCT ATCTOCAAGA 
128301 TOOLIILIII TATTCAGGTG CAGCTCATTG TTTC LILI IG AGCTACACTT 
128351 AAGATTCTTC AGCAAAAGCT AAACTGACAT TT CT OC A GCA ATCCTCTOCT 
128401 TGAGATAGAA ATOGGAAAAG TAAGAGGAAA AGGAATCTTT ICI I LI CATC 
128451 TCCATACACT AACTCATAGA AGGTTAATAC TTCTATAGGC TCTACTATTA 
128501 TAACAACTAT TATATATTTA TGATATATTT OCTTAAAGAA AACAAAAGCA 
128551 ATATAGACAT CTAAACTGTC ACTGGCTTAT TAAGTGTCAG TCGCAGAGOC 
128601 TAGGAGAAAA TAAGGAGGCT GTGAATTOCT TACTOGAATC TAAOCAGAGC 
128651 TGCIGIGI 1 1 GAGAGCAAGT TTTAAAAGAT TCTATCTAAT ACTAAGTTTA 
128701 TTCATCTTTC AGACTGAGTC CXAGCATCAC CAGATCAGTA TTTCATCGCT 
128751 GGATCAATCT TTATTCTOGG GAGTGATGAA GCATTCAAOC TCCTATATOT 
128801 ATAGTTTOOC GAGGGTOGGC AIGIGCIOCT TCTGGQGCAG GCATOGCTOC 
128851 ATATAAGGAA TAGGTAOGTT CTCAOGAGGC TCAOCTACTT AOCTOCACAT 
128901 TTAGGCAGAT TCTOGGTATT AAGATCTCCT GGGAAAGAGC ATCACTACAG 
128951 TAGCTACAAA TAAGGTGGAA GAAGCAAAGT ATTTTTCTGA GAAGTACTTA 
129001 AAGAATAGAT GTOTAAATTT CTATAAACAC AAGTCTTAAA GGAAAATCAA 
129051 AAAATT7TAC ATTTAAATAA CTACATAAAT CATTOGCTAA TTTTAATAAG 
129101 AATATAACTT AATATAGCTT GAATCGAGAA AAGGACAACT TCCAGTCAGG 
129151 GAAAGTATTA AGAAATAATA TOCTCAGTCT GGGCGOGGTG GCTCATCGCT 
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129201 GTAATCGGAG CACTTTOGGA GGGGGAGGCA GGCAGATGAC GAGGTCAAGA 
129251 GATCAAGAGC ATOCTOGCTA ACACAGTGAA GGOOGATCTC TACTAAAAAT 
129301 ACAAAAAAGT AGOCAGGGGT GGTAGK3GGC GCCTATAGTC CCAGCTACnC 
129351 GGGAGGCTCA GGCAGGAAAA RSI PGTCAAC CTOGGAGGCA GAICI PGGAG 
129401 TCAGTOGAGA TOGOGGCACT GCACTOCAGC CTOGGTGACA GAGCAAGACT 
129451 GCATCTCAAA AAAATAAAAA AAATAAAAAA AAGAAATATT ATGCTCAAAA 
129501 TATATAGCAA TAAGTTGG^A ACT 1 1 IACTT GAATAATTTT TACAAAACTG 
129551 AGCAAAGAAC AAAAACGTGA AGAGGGCAAG TPOCAAGACT CATCTTATCT 
129601 AAATTCTTCT AAAACAACAT TAGCACTTAA CAAAGTTCAA AAGTTAACAA 
129651 AGGCAAGTAC TGTACTAGGC TTCCAACACT AACTAAGTAT AAAATTCCAC 
129701 AGAGCTOGTT MC I I A TCTT TAAAGAAATT IGI IGGCAAG TCGTACTGGT 
129751 GTTAAAAAAA AAA/WWWV AGGAAATATC TACTCAGCAA AATAGAAAAA 
129801 AAATATGAAG CACATTAAAA GAAAAAAATA TATATTCCTG AAAAGLI IGI 
129851 ATAATTACAG TQGCATOGTT GGGAAIGI 1 1 GGTCTATAGT TTTAACAATT 
129901 AAATCCATTG AATCTOGOOC GGTAGCATCC TAAAGTTTTA TTCTAGATTC 
129951 7CTOG4CTTT GTCATTATAG ATATCTTTCT AAGATTTAAG TAACTTTCCA 
130001 TCTTTATCTC CTTTTATCTT TCTACATAGA ATAAAAATOT TTCTAI IGM 
130051 AAGAATATTA GAGTTQGACG CAGTOGCTCA GGGCTATAAT GGCAGCACTT 
130101 TCGGAGGGCA AAGCAAOTAG 1 1 IGI I PGAG OOCGGGAGTT CAAGAATGGC 
130151 CPGGGCAACA TAGTAAGAOI GCAICIUAC AAAAAATAAA AAATTAGCGG 
130201 GGCATAGTOG CATCTOOCAG CTACTTOGGA GACTAAAGTG GGAGGATCAC 
130251 TTTCAGCXXA GGAGGTTCAG GCTCCAGTCA GCTATCATOG CACCACTCCA 
130301 TTOCAGOCTC GGCAAGAAAG TCAGAGCTOT TTCAAAAATA AAAAATTOGG 
130351 GTTTATCTAC TTAGAI 1 1 IC AATAAAAATT ACTACTTAAA ICI I I AGOG 
130401 CTTGTTAATT TCAAACGCTT TTOACATTT TGATTTATCT TTAAATCTCT 
130451 1 1 1 1GICI CA ATAAATOGGA AGTATCAGGA AG I CI 1 1 1 IA CI IGULAAG 
130501 GTCATAG4GA GCTTAGAACC TGGTAGTOTC OCTCTCAGGC GCAGI ICI I I 
130551 CCAAOCTOCC AGGCTCTAGG GOCAACAATT ACTCAGCACT AAGAAATTAT 
130601 GL I 1G IGC IG TCATGGCAOT TCCATPGGAG AAAAGGATAT TTAACTGGCA 
130651 AACAAAAGTC AGGAGAATGG GGAGATTTTC I ICI 1 1 IGAA ATCCTAGTGT 
130701 GAAGTOCTAG GCTTAI I 1 1 1 CAAATOGGCA ACTOGTATTC I I I ICI 1 1 IC 
130751 I I I I I ■ I I I I GAGAGGGAOT CTCACACTOT CGCOCAGGCT GGAGTCCAGT 
130801 GGGGGGATCT GGGCTCACTG CAAGCTOQGC CIGUGGGTT GAGGGCATTC 
130851 TCCTOCCTCA GCCTCOGACT AACRGGGACT ACAGGGGGCC ACCACCAOGC 
130901 OOGGCTAACT 1 1 1 1 1 i I I I I 1 1 I I IGI ATT TTTAGTAGAG AOGGGGTTTC 
130951 A OJGIG I I A G OCAGGATCAT CTTGATCTCC TCAGCTOGTC ATCOGOCCTC 
131001 CTCAGGCTXjC CAAACTGCTC GGATTACAGG OGTOVGCSCAC GGGGOOCAGC 
131051 GGGCCAACTC GTATTOOAA AQGAATCATA ATTTTAGCAT AAGAGCATAG 
131101 7TTAGTGATT GAAGAAAAAA TCTAGOGAAC TCTATCATAT GATOGTX7TCA 
131151 AAAAGAACTA AOOCAATATC AAACAGI 1 1 1 CAGGAGCATC TTTCCTATTT 
131201 TOGTCTCAGT GQVOCACTTG TCTTAAAGTT GTOAAAAGCA ACACTCCTGA 
131251 ATTCCAOGCA GAACTCACAC TCTCCAGCTT CAGAGGGGCT CAGATTGTCA 
131301 GTCGCTGCOC GGAGGTCTAC TAGCAAOCTC CAGCTPGGGT AGGTOOGTAG 
131351 G IGTG CTAGT AGGGGCTAGG AAAAAGAGAA CAGATGAGGA CAGTCATGCA 
131401 TACAGCTCCT TTAICIGGIC ICIOCI IUT CGCTAGCCTC ACTOCTATTC 
131451 GAGGGCAGOC TCCAGGCACA Gl ICIGI ICA CAGGCTCCTG CPGCGCGCGT 
131501 GGGGGATGGT TCCAGGAOGG CTCCAICI IC IGIGU I IGG GCAOVTTAAC 
131551 CTOOGCAGC GTCACTATCT TCATCAGCAA AATCGAGATA ACATTAGTAC 
131601 CAOCTCATAA AGTTCTTATC AGGATCAOCA GTGAGATAAT CAATCTAAAG 
131651 TCCTTAGAAC AATCCTTGGC ACTTOGTAAA CACTAAATAA ATCATAGTTC 
131701 CTATTATATC CATACTTTTA AAAAAOCTCA TOCTTTTAAA AAI 1 1 1 I ICI 
131751 GCTCACTAGT GA A I I G I I CA CM I I IG I l b I I GI IGI I GI IGI IGI IGI I 
131801 GTTTCAGMOG GAGTCPOGCT CTGTQGGGCA GGCTOGAGPG CACTQGCATC 
131851 ATCTOGGCTC ACTOCAAGCT CCACCCOGGG GATTCACGQC Al ICICCIGC 
131901 CTTAGCCTOC CGACTAOCVG GGATTAGAGG GGCOGGOCAC CAOGGGOGGC 
131951 TA A I I 1 1 I I I GTAI 1 1 1 IAG TAAAGAOGGG GTTTCAC3CTT GTTAGCCAGG 
132001 ATOGTOtGA TCTOGOGAOC TCATCATCTC GCXACjCPOGG (jCTOTAAAG 
132051 TGCTOGGGTT agaggcatga ggcaoggtoc cggggcatca ATGGTTCAGT 
132101 TTTAACAGGT ICIGTGOCTT AAAAAAGTTA TTAAATTCAC IGI I ICLICJC 
132151 1 1 I 1 1 IGIA C CCATCATACT TTCAATATAT AACTAQGGCA GCATATAAAC 
132201 ACTTCACTTC CACTTATTTA TTTAAATCTC CAICI I ICXA TAACCCAAGG 
132251 GTAGGAACGA AATCTTATTC ATTGTTGAAA GCTCTAGCAC ATAGCACAGT 
132301 GGCTACCAAA TAGTAGGCAA AATCAGGTCT TCAATTCTAT TAACTACTCT 
132351 AACACTCAAC TCAAAGIGI I CAATOGCTCA AAATAATATA ATAAGCCTTA 
132401 ACIC TO GGGT GCTTAAATTT ATCTATAATC TCTOGCAGrc AAGTATACAT 
132451 A G 1 1 I IA AAG GTTAAAAAAA AATCAAGTOT TTAATC^ATT TCAGCTCATT 
132501 GAGCACK5AC CAGATACAGG ATOCTTAAAC TTCATAATAT CTAGTGCAAA 
132551 GATCAATTTT M il l PG GTA CAGATTCTCA CTTCAAGGGA TTTCCAGGCT 
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132601 GGGAjGGGAAA TTAAATATAT AAAAAGTCAT T7TOGGCCAG GGGCTCTOGC 
132651 TCATOCCTCT AATCCCAGCA CTTPGGGAGG GOGGGCAGGT GGATCACT7G 
132701 AGGTOVGGAG TTCAAGAOCA GCCTGCOCAA CATGGGGAAG OCXXATCTCT 
132751 ACTAAAAATA CAAAAATTAG CTGGACTTPG TGOTGCTTCC CTCTAGTCOC 
132801 AGCTACTCAG GAGGCTTAGG CAGAAGAATT GCTTGAAjOCr GGGAAGGGGA 
132851 GCTTCTAATC AGCTGAGATC ACACCACTGC ACPCCAGCCT GGGCAACAGA 
132901 GACAQVCTCC AGGTCAAAAA CAATAOOTTT TTTTAATOT GTCCCTTAAT 
132951 GGAAATATTG AGAAAATCTC TAGGGGAACT GGAAGGAGAT GATTATAGGA 
133001 GTTGATTATC TATCTAAAAT CAAAGTGAAT GAGGCAGTGG CAGGGGGGAA 
133051 AGGGGGAACC AGTAATGACT TAGAAGTCCT AAGGATCTTG CATGGTAATT 

133101 GTCAOVnTG CI ICjLIGGGA ggggagctga cu pgtggtg toggtcctag 

133151 GTACTCTOCT 1 1 MCI I CA T TGGTCTCATC TGCATGATCA TTGCTCTGTG 
133201 TAGCAICI I I GGGGGGCTCA I IGC I GIG I G GCTCATCATG GGTCTCTTQG 
133251 ATOGATOCTT CATTTOCATT ATGGCTQGCA TAGCCTTTGA GTVAGWCGT 
133301 GCCCAGGA7G TCTOGCAAGC AATTOGATTT CTGCTOGGAT TCAIGICIAT 
133351 AOXATOVCr GTTGGGGCAC CCATTOCAGG TAAATATAAT GATTCTCCAG 
133401 TAGTTATATT AATTCATAGT Al I \ I C I A CT TCAGGPCTTA ATTAAGTOC 
133451 ATTTATATCT AAAACATATT AC AGGTTA TT GATTACTGGT CI 1 1 IGU 1 1 
133501 TATGTGTGCC TTACTOGTAA CAI 1 1 I IAAT AAGACTAGCT ATTAAAGAGT 
133551 ATC1 RSAATT TCCTGAAGAG TCCCTCTTAT GCTTACTOGC AGCTAAAACA 
133601 CTATATAAAA CACTTTOGGA GGCTGAGGOG GGAAGATCAC CTGAGCTCAG 
133651 GAGCTTCAGA CCAGGCTGGA GAACATAGTG AGACACTATC TCTACAAAAA 
133701 ATAAAAAATA AAAAAATTAG GCAGGCGTGG TGGCATOCAT CTGCAGTOQC 
133751 AGATGCACAG GAGGCTGAGG TGGGGGGATT GCTTGAGGGC A&GGTCAAG 
133801 GCTOCOGTGA GCTGTGATCA TAOCACTCTA CCCCAGCCTA GGCGAAAOOC 
133851 TATCTTTAAA AAAAAAAAAA GTTGAAGATT CTGAAAGATT TOAAATATG 
133901 TOQG LMMG GTAAGAGATC ATACATOGGC CTGAGGAGTA GTTGATGAAA 
133951 AATAACATCA TAGAAAAAAC CCAAATOCCT GTAAATAGCT AACTGGGTAA 
134001 GCAAAATCTG GAATATCTAC AGTGCAATAT TATTCAGOCA TAAAAAGGAG 
134051 TAAGGTACAG AACTGATGCT AGCAGGTGAA 7GGAGCTCAA GAACATTATG 
134101 CTAAGTGAAA GAAGOCAAAC ATAAAAGGTG ACGTGGCTGG GCATCGTGGC 
134151 TCACAC3CTGT AATCCCAGCA CTTPGAGAGG GCAAGGCAGG AGAATCATTT 
134201 GAGCTCAGGA GTTOCAGACX AGCTTOGGTA ACATAGCAAG ACL I IGIUL 
134251 TACTAAAAAT AAAAAAAATT AGOCTGGGAT GGTGACACAT ACTAATATTA 
134301 GTCTAGTGGC AOCTACTTXA GAGGGGGAGG OGAGAGGATC TATTGGGCCT 
134351 GGGAGATCAA GGCTGCAGAG COGTGATAGC AOCACPGCAC TCCAGGCTGG 
134401 GTGAGVGAGT GAGAGGCTOT CTCAAAATAA AAGGTCATAT ATGGTATCAT 
134451 TTATTTCATC TCAAATATCC AAAATAGGTA AATTCATACA GATAGAAAAC 
134501 TGGTGACTGC CAGGGGCTGG AGAGAAAGGG ATTCGGAAGT AACTACTTAA 
134551 TTGGAATCAA GTCTOCTTn GGCATGATCA AAATAI 1 1 IG GAACTAAATA 
134601 GAAGGAGTGG GTGCAAAACA TTCTCTATGT GCTAAATGGC ACTCAATTCT 
134651 TCACTTTCAA AR3GTTAATT CTATTTCATC TCAATTCCAC CTCTAGAAAT 
134701 TCTCTGAATC ATATTATAAA GATCTCXTAT GTTCAGTATT ACTTOGGTGC 
134751 CTCTGPGAAT TAGTATTAGA AGATTCAOTA TAAATATGAT AAAATTTAAA 
134801 TCTGTGTACT GTCATCTGTG TT7GGCACAT GAGGTGCPCA GTGAAGGTTT 
134851 AGCPCAATTC ATTCTCATAC GCTGAACTGT ATTCCTAGTC ATGAATCAAG 
134901 CAAGCAGAGT TACTACTGAG CAGACCAAAT GCACTTTAAG TTCAAAAGGT 
134951 CATCGTCTGC TTATAGGAGA AGGCPCCATT GCTTCTTOC TGCGGCAGGT 
135001 AATATAGTAT TCAC1CI I C I TTCTATAGGC TACAAGCTTA GTTTTATAGT 
135051 TTCTAAGTOC ACAGCAGATT TTATATAAGC AGCACTCTAG TAGTAAATAG 
135101 1 1 I raCTAQC AGAAGTGACA GTTAACAGAG CAAGAATAAG GGAGGTAACA 
135151 GGAAACTTCT AAAGCAGGCA GAACACCACC ACACAGCAAC TGGCAGCAGA 
135201 AAGCIGCPGG CTCACCAGGA GGCTCTGAGG GCATCATAGT ATTACAGCAG 
135251 AACATAATCA TCA GTCTTCA TAAGGAGGAA GAGTTCAGGA ATGGTCTGAT 
135301 TGCAAAGATA GAAAACTTOT GCATTTACAG ATA I IGICIA TTTTATATAT 
135351 GCAGAA7GGA TCTACACATT TTTATCTUTT GGTACTTAAT TGGAGAAATG 
135401 GTCAGCTATT TATTTAACAA TCTCAGATAG ATATTTAGTA CAGTATATTC 
135451 GCTOG^CATT TTAGATAACT TAG G1 1 1 I A T AGTATAAGTT ATAAGAGTTT 
135501 AAAAACACTA AGATAACTTT TAAAACGA7G AGTCCTGGAA 1 I IGI IAGAG 
135551 AAATTAGAAA TCTTCAGTAC CGTAAAAGTT TTCAGAAGCA GAAGCOGAAT 
135601 ATAGAATCGC ATTAATAATT CTTATATACT TACATTTTCT TGCAGGTACT 
135651 TATAAAGCTT AGTCTATTTC CAAAATCTAT TTAAGAAOCT AACTTTCATC 
135701 CAAATTATTA AATTTAAAAT 1 1 I ICI IAAT TTTGAATTAT GATCTTAAAT 
135751 AGTTTTCATG TTATTTCATA AAAAACAAAT ACAOCAAAGA GTAGTTTGAA 
135801 AGCTOGGCAT GGTOGCTCAT GCCTCTAATC TCAGCATTTT GGGAGGCTGA 
135851 GGTGGGAGGA TTOCTTTAGC CTAGGAGCTT GAGAGCAGGC TGGGTAATCT 
135901 GACAAAADOC C A P CTCTA GA AAAAATATAA AAATTAGCCA GGGATGGTAG 
135951 CATGCAOCTG TAGTOOCAGC TACTCAAGAG GCTGAGCTGG GAGGATCAAT 
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136001 TCAGCTPGGG AAGTCAAGGC CTCCATGAGC CATGATTATC CTAOGQCACT 
136051 CCAfiOCTOQG T&CACAGTG AGAGOCTCTC TCAAAAAAAA AAAAACGTCT 
136101 ACCTTTCTAT CI 1 1 1CA AGT AATCTCTTOG TATCAAAATC CAGAGGACTC 
136151 TAOTTATCA CAAOGTTAAA GATATCAGAT C I I M PCI PC CTATCAAAAA 
136201 AQGTTTATAT TTCAGATCTG TTTCCTAAAA AAAAAAAAAA GTCTGATGTC 
136251 TCAGATCTCT GAATCAGTCC TCTOGCTGAT CTAGACCGCC TACAGAGGCA 
136301 CACI TCI I C I GGCTTCGTCA CAG C I 1 1 1 I C C CI 1C I CA GG GTTACTTOGT 
136351 GACAAACTOG GCTCCTATCA TCTQQCATTC TAOCTCGCTG GAGTCCCTOC 
136401 OCTTATTCGA GGTGCTGTGC TTTGTTTTAT GGGGTOGATC CATAGTAAGA 
136451 AGCAAAGAGA GATCAGTAAA ACCACTOGAA AAGAAAAGAT GGAGAAAATC 
136501 TPQGAAAACC AGAACTCPCT GCTUTCAAGT TCATCTOGAA TCTPCAAGAA 
136551 AGAATCTCAC TCTATTATTT AATATCTTAC ATAGCTOCAC CAGACTOGAC 
136601 rTOCTTTTTC AATTTTAAGC AAGTTTOCTT TCU I 1 1 ATA CAAATTCCAA 
136651 ATTTCATATT I 1 1 1 lAATtA CATOCTAQGA ATAQCACAAT AATTOGGAAA 
136701 TAGAACGCTT ATCACTAGAA GAACCATTTT CTCCCACTAA ATATCTCTGA 
136751 TCTTTOCATG AGTCTCAGGG CAGAGACTCT GCTATATGAA AACATCTCTC 
136801 AAAOTCACAT ATTCTCAAAA TTTCAAGCTA TCTCAGTAAA AAGCAGCTTT 
136851 GGAAACTGTG AATCAICI 1 1 AGCI IGIACA AATCTTTAAA AATAOCTCAG 
136901 GCTATACnSA AAQQOTTQCA GTTTGGTTAG GAGTOGAAAT ATTTTCTTTC 
136951 TTAATCATCT CTTXAGI ICI GGTAOCTCTG TTTTACTTTC TTATOUCI I 
137001 TCG4AACTTT TTOCAAAATT TAAGCCTGGG TTCTAGATAA TACCAGATCT 
137051 AGCTAAAGCT CAAGTCTATG TTAAAGTTCC 1 1 ICCIGUG TTAAATAAGC 
137101 TATGATATTA AGATATTCTG AC I PGCTOCA GTCTCAAGGG AGU ICIGGG 
137151 AGCAGCTGCT AACATAGTOT TCAGAATCAA TATGTGAGAT GAAAAGGATC 
137201 CCCTOCAGGA GGATCCTGAG CTGTTCAGAA ATCATTTAAG TTTACAGGGT 
137251 TCTTOOCTTT GOOTTTGCAG TCOGI 1 1 1 AC TCAAGTAGCC AGAAACAOGC 
137301 CA OjI I ICIG AATTTCTTTA AACTCTAACA ATAAAOTAAA ATAGAATCCA 
137351 TCAAAGATAT TCTOGOGATT GTAACTTAGA Al I I I ICIGA CI ICIGGATT 
137401 TCTTOGCACT AGAAGCTGAT ATTTAAACAA AGICI IACTG AGCAGCTATC 
137451 AAGTOGCAGT TACAGGCACA AATTGGTGGA GGCTGGAGGA TGGGGAGGGG 
137501 AGCAAAAGGC TTTATATTTG TCAAGAAAAT ATCTOTAGCT GATAGAAATA 
137551 ATTOCTTAAA TTOGTTTATC AAATTAATCA GTCTGAAAAG GTTAAAAGCA 
137601 CTTATAAAAA GAACCAAGTC CTACATTTCC AGAACI I ICI GGCAAAAATT 
137651 TOCACTCATA TTATTTATCC TATCAACATT CCCATTCTTT TTTTTTCCTA 
137701 TTTATATACA GATTATCATA AGAAAGCFCT CAGTTTGAGG ACOCAAAATA 
137751 AAAC3CAAAGT CATGGCATCA OGCATACTCA TTTACAAAAA CAAGAACACT 
137801 TTC C TCT A TC OCTAAAATTA TCCTTTAGTA CTPGAGGCCT TTAAAAGTTA 
137851 GTGCI 1 1 IGA I IGIG4AGAC ATTCAGCAAC TTACI I IGIC ATACATGCAG 
137901 TTOCACGTA CCA CI ICIA A TAGTGTCATA TTTCATATTC AGGGGACTTA 
137951 GATAATTTGC CTCTGGATOG I ICI I 1 1 CCA GG AA AAA A AA TCTACATTTT 
138001 GAOCATACTA (JUL 1 1 I CATC I ICI IATTAT AAGCI 1 1 1 AG AAAATCATTT 
138051 GATTCAOTGA 7GTOCAGTTA TATAAAAOGT TACI I ICICA TTTTTCAGAA 
138101 GTTCAACAAA AGATACTACT AAGAGCAATC ATCAAAGOCA CTATTATAAA 
138151 TCTTAATTTT GGCTOQGTAA GGTOGCTPGC GCCTATAATC GGAGCACTTT 
138201 GGGAGGCTGA GGAGGGAAGA I IGCI IGAGC CCAGGAGTTT GAGAOGAGOC 
138251 TCGGGAACAT AGCAAGATCC IGICICIACT AAAAATAAAA AAAAAAATTA 
138301 GGCCAAQCAT AGTGGCTCAT GCCTOTAATC CTAGCACTTT GGTAGTCCAA 
138351 GGGAGGGGGA TCACTTGAGC GCAGAAGTTC AAGAOCAGCT TCGGTAACAT 
138401 AATCAGAGGC TCTGTCTACA AAAAATTTAA AAATTAGOCA GGCATGATGG 
138451 7GCGCACCTG TAGGOOCAGC TACTCAGAGG CTGAGGTAGT GGAAGGATTC 
138501 CTTCAGGCTA AGAGATOGAG GCTGCAGTCA GCTATCGCAC TCTACTCTAG 
138551 GCrcmCAAC TGAGCAGAAC OCTCTCTOTA AAAGAAAATC AAAAACAAAA 
138601 AATAAATCTT AAA 1 11 IGI I TTAAGI 1 1 I A GCACAGACTC OOCPCAAAAC 
138651 A0C1 I C I C GC CAATTTTACA GAAAtJTAATT CAAAAATCAA AACTTTACTC 
138701 TCTAAAGAGC TOAGAGTCT 1 1 I ICI 1 I IC AAAATTTGGC TGATTTTAGG 
138751 AAAAAAGTGA TCATCTGAAA CTAAAAGAAA TTOCTTOGTT AGTTTCCATA 
138801 TTAAAACAGC AGTGAGAAGT ATATATAACT TAGATCTOVG CATATCTCTT 
138851 TCTATATTAA ACTPCACATA TCTAGI 1 1 IC AGTTTAATOG AATCAATCAA 
138901 ACTGGATCTA TAACACPGAA AAAGTTCTAT TCTAATAGAC TCATAGGGAG 
138951 AATACTCTOC TATAATAATA TAAAATTAAG AAGAAAAAGT ATAAAOGTAA 
139001 GATCCTAAAT TCCATAAATC CATATTTAGT ACTAIGI 1 1 1 TTGTGGGAAA 
139051 AGTTCTAAAA GTTTAAAATC CACAAAGAAA ATG4AAAATA CTAATATAAA 
139101 AATTTGTGCT TTAATCTAGT CAAACTAAAT CU I ICIAAT TTCTGAATCA 
139151 AGTTJTTACrc CTGCAATAAA GTCACCTCAT AAGCCTAAAT Mil IOICI I 
139201 CAATOCAGAC AC 1 1 I "ICICA GAGTCPGAAA AGAATACAGA GTCAGAACTC 
139251 TGTTTTTATC TCCTCATCCT GTTTTTGATA AGACTOVGAA AATTCPCAAA 
139301 TTOGAAAGGT TCTOGCATTT GAGGGCAAAA AAAGCATCAA AGGGAGTAAC 

139351 ATrocrrm atagatactc tagattogat actattctaa cagatcgcca 
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139401 AGAAACTTOC AGAAACATTT TOGTTAAATT TTATTCCAAT QGATATTOCT 
139451 GGGATCCATC CATTTAAGCA GTAATATAOC ACCOVGATTA TTGATACTTT 
139501 ATQCAAGATC IU ICATCTC TTTCCTCATA TTTACAATQC TTACTOCATA 
139551 GCCCTGCTAC AAGACTTAAA ATT 
(SBQ ID NO: 3) 
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Context: 

DNA 

Position 

352 G^TGGTTAGCQVGGGATTATG 

CATOVCATv^TATt*^^ I I I I ICC 

AC0CTAAAGTT A 1 1 I U l O CCQO^CTDOCiO C I i TCCATCCTATAC 1 L 1 1 IGGAAGAAAG 
TTACTAOj<^TAJG0^^ 
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AOT A I G I i Q XTAAACTATTTAC AI 1 1 UL I G CAGGAGAG^TTTXaTCr A TTCrCTCGrC A TT 

[A.G] 

TTTATTTAATCATTTACT^^ 

Gl I l(j I (3 1 CO VroCAAATATATAATOG A KjTCCI I IC I I I iX XX AAT AATATOT TCPGGA 

CAACTnOVXACTCAATAAAT^^ 

TATCTAATCAGrCrrO^^ 

TACTGQGGTATCA I C I I CATACATACA I I n IG rGCACTTATCTGGCTA I ICU I FIJI 1 1 



381 CAOZAAGfiC&CAGMtt 

TCTAGTTCAGGTAC 1 GTGTOOCAT ! 1 1 I IGCAGGCTAAAGTTAI I ICI IGGCGCCACTOC 

GOCTTTCCATCCTAT^^ 

AAGCATTCTACTnCAlOCrGCA^ 

GGAGGACaAGATTTTTTCTATTCTO 

lC f T] _____ 

TCACTOGTGGATA A I ICI I A CAT AIG 1 G I I I GI I IGIGIG CA7GCAAATATATAATO»AT 
G1GU I ICI I IG COCAATAATAIG1 I G I GGACAAGTrTT^AAGTCAATAAATA^GATGA 
CCITO^GAACTTTT AGAGGTTTT AAACTAAGTATCTAATCAG I C I I C I AOCAATCTACAT 
TATACrra^W\TTTTrXTTAT^ 

» 1 1 G I G CACrTATCTCCCT AI ICC 1 1 ICI I I'ACT ATTTTAlGQCPCA ( I IC I A AGGCAGAT 

3505 GGCTGGTTAGGCAAAAAACAAATCT^ 
TCACinrrATCeTGCAAGGTCAGC^ 

GTGAAA G 1 G1C I I GG I G GGTAGT AGAATGGCAATAAAACAC ATATO VACreA CrTC^ AQ^ 
GCTAACTGATTTCCATTACTAAATCAAGCT 

GATTAATA G I C I 1 1 1 1 I i I IAAGCTTGAI 1 1 IC1A TT AI I I I I A GAGTGAAT A I I I CI IA 
[GfA] 

GTXTTrAOTATGCATATGAGGAATOG^ 
GCTGGGrrATGOGAGAGTTTTAAA^ 

TCTGCAAGTTACI 1 1 ICI l(XATTOOGCATCAIGI 1 1 G I G ACrrATGCAGATAATACCTC 
AGTGTTjAACTGATAACTCAA A I I CI I ICC ATTTTAACITAG G I GG I I I G CATTGTTTGAC 
TATATTATACTCTATOCATRnO^ 

10280 TOQGAGGG^GOTAATTACCTOCjTGG^ 1 1 1 ICCIGIG 

C I G I I C I I G I GA TAGTTAATAACjTCr^TCAGATCrGATGGT 

CTGCACA CI I I C I GI I G O L I G I lG C3CAltnAAGAOVTO0 C I I I GUCCICI I ICACCTTC 
OVDCATWTTXjTCAGGCOCCCrAGQCATXnXaGA I C I 1 1 1 1 

CTTTATAAATTAjGOCAGCCrCAGATATT^ 

[G,C,T] 

TTXATTACCATAAAGAAAAGGCTTTT^ 
GTOVrTACOXXATGGATAGAT^^ 

AGTATTTA C I 1 1 1 A TTAACTTGAACTTAAGAGCA GC I I I I IGGCTGGAAAAAAGTTCTAC 
TTATCTCAAAGI I G I COIWVAGTAGAATCCTAC ITXTGTC^^ 
CTACATATTTACTTGCATCTTTrr^ 

1110 7 GGAAACTCCAGCACACCTG^ 

GTOGCTGGAGGAATGGAGGAAAGGft GTGCI 1 1 CA MTCTAiCATGCCAAATTrrGrATCATC 
ATAO CIC I I I GI IAAAGTCOLI IC I 1 1 A ACAGCAAAA GTAATTO CniACCrrGC ATATAG 
G^AC rA AA AAA AAGTOCaATCAAGAAATGGCrTGC^ I C I II I I 

TCATTTCACTAATTTTTAATTATGGGCATAAOT 

£ G '*I 

TOGACCAAAGGAATC1 1 HOG I rGTO GAOCAAAGCAGTGA ATCI 1 1 1 1 ICI I I C I I ICI I 

I C I I MC I I M 1 1 1 I GI l A AGftGAOVGGU ICI I GCICIC I IG C ICA GACTGGAGTGGAGT 

OVGT^lXjTXjATGGCnONOCATAAOCTCAAO C PC I I G GGCTCAAGQGAT OCTGCTG OGOC 

AACXITCCTGAACAQCTOGGATTACAGG 

I I 1 1 I 1 1 G I G GGGGAGGGT X ll'CTOATATTOGQCAAGL I GG 1 1 1 C AAATCGCTGAGCTCA 

15750 TGGCTOVCTGGAGOCTajAOCTCC^ 

TAgg gg^ 1 1 I 1GIAI 1 1 1 1 1 jGI AGAG 

AGCTOVGGCTCTGAAAGTGCTTjGCATTACA 

TTTTCAATTCTAAAAAAGTAAOCTGCroCCTACTGAA AAATAGAGTTAAAAAAAGT AA 

[T.C] 

OGGTACAGAOVGCRrrATTTT^^ 
AATACATTAACATTTOPGCAGCAAAA 

TAQCTTCAGGTOVGTTOVGATCAAG1 1 1 IGLI ICIAATTAAAIGI IGATAI I CICI fACA 
AAAACTTTOGGTTTCGGTTTTCAGATTT^^ 

TGAGACAGA GIC I I GCIG I GI I GL I UVGGCTOGAGTGOCATCGCACGATCAjOGGCTCACT 
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16004 AAAGTAAOCTGCTOCCTACnGAAGr ^AATAjWGTrAAAAAAAGTAATCTXXjTACAGACAC 
OCTATTTTCTGACAGCCCT^ 
TTOXX^VGCAAAATCTATC^ 

GTTCAGATCAAG1 1 1 1GU I C I AATTAAATCTPGATA I I CICI I ACAAAAACTTTCGGTT 

TCGGTTTTCAGATTTGCAAAT^ 

[ T,A] 

GcrcrcrnoaovoGcraG^ 

AGQCTGAAGOCATOGRXCACTTCAGCCKXX^^ 

ACATCrOCAQCTCAl I I I PGTA1 I CI I A GTAGAGAGAGGGI I I CXCCXjTCTTGGOCAGQC 

TX3GTorAAACTarnoc7rcro\A^ 

TATAGGTUTOVQCXACTOGGOCTCGC^ I I PQC1 1 1 1 



16871 QOCTATOVVCAACTTTATAACACr^ 

MTOVOTTATrxnOCX^a^^ ICjCIGAAAACQCrAOCAAlGl I IGG 

TTTTATTTTTATTACTATTTCCTAATT^ 

TTAAAATCATATTTOVb I GC I rATAAGTCRHCI ICGCAGI I I I IGAGCCCTTTAGAAGC 

TCCAAATCAOCTOGCAATTATATATO 

[A,G] 

TATATTAG^AAAACrrCATrmXTAGOT 

CCTTAl^TTCTOVCCTCACACA I C I IGGCAGGTAGAGCAIGICCI IGI I ICCICIGCIGI 
CrrAGCCCAGGTGTTGATCAAG Cj I C I G I C I I A GGGGGGGGGATAGGAATOGAAATAAAOC 
ATCTAGAGACTTOGGOVntjAGGAC I I IGIGAI ICI ICXAGGTCACATCKIATCCrTXZAGA 
G^TO^GTCroCAAGAGTAGOC^TATCrrAA ICICI 1 I CAGTCCTATCAGCTTCCATCA 

17163 GCD^GTCAATATATTAGAGTA* 
TATCTTAGGCCTTAGATTC^^ 

CTCTQCTSTO IAQ00CAG GIGI PGATCAAGGTCPCTC1 IA GQGQGGGGGATAGGAATCG 
AMTAAACCATUrAGAGACTTGGGCATGAQGAC 1 1 IGIGAI ICI I OCAGCTGACATCTCA 
TOCrTONGAQGATt^VGT^ 

TTXXATCAACCTCTGGfO^ 

GCTATCTTAAAA A I ITTCn I GCATGTATTTAT7TAGTTA I 1 1 I IATCAGAGAGGGTCTC 

QCnaUTO\OC3CATCCTOC^^ 

CrRDQCAGGCTCAAGTCATX^^ 

ATCTCACCAGATTTCGCTAA1 1 1 ICATAI 1 1 1 1 rXTTAGAGAGGOGCal I ICJbCCAOVTTQC 



17966 TCAAGO^TTCrCCTGTOCAGCCT 

AOCTOGCTAA1 1 1 1 IGIA I 1 1 1 rA CTAGAAATOQQGI I nCACSCATCTTOGTCAGGTOGGr 

CTOWVCTATTC^OCTOVQC^^ 

GATGTGAGCCAOCATXrrCCAGCO\OOCA^ 

AGCCAOGCAAGAAAOWVTTOGA^ 

[A,G] 

MTIOTAGATAAGTAACTCTGA^^ 
ATATAGGATTTTAGAGGAAGGGA^^ 

GTTT1 I G AGTTAG G I G i l A CGAGATTGGGAGA L I I l " l C I CA GCATATCTAACAGAAGAGG 
GTATCGGAGCnGAGAGPCnAAGGOCnSGGCAAGGGI I GGGAGGCAGTrCTAATACTGAAT 
G1 nCRa A CTGrSGI I I ACT ATGTATTTCAGGTTA I ITTCn I AATCTATCGAGTAATCCT 

19392 GAACATGACATAATGGAAAOCCAGAAATCC POTG IGAGGGGACTCAGGGGCIGCI ILIA 
GOTAATTTAGTPC AI I I C I A CTGAAATCATTATATTTAAAAGTATOGGGGCACrGAGATC 
GCCACTGTAGCTXjCTOCCACOOTAGLI I IGG I C I IAAAAAAAAAAAAAAGTAATAGAA 
CI I (XI rA AA ATCTCI I PCTA G OG I PQGA1 1 rCTCTAATTCAGAI I PGOGTCI PD 0CAA 
GGGTCAAAATTATA I 1 1 1 IA CTATOO LIG I C I I A GOTATTTCCAAAA CI ICGICI I A AGA 

[C,G] 

TTAGTCAI M ill ICLI I CA ATTTCACATCAOT3CTAAAGA C I I I PQGCATG1 P0CTCCT 
CCI \ 1CAI 1 1 G I GATCTAATTAAG I I G GICIGIAAGTCrTATTTTT AAG AI G I I C I A GMI 
CAAGAGACTCTGAGAATAGCnAGAGTCATT^ 

GTATTCACAGTGGTGAAAAOLIC.I I CAACAAGCAGATGATTnTATCTTATATATTCACAG 
AGrrTAGTAAOXyWQOCAACTAC^ 

20113 TTCAGAAATATOGACTCTTTT^^ 

GAGGTCTC AICI I IA TGGCA CI 1 1 C I G GCTGGGAAGGGAGTCAGTCCTAATTCAGATAAT 
AACTAG0CACOGG0CACACACAA G I GIGI I I I C<3CITAGTTACXrTTnTIACACACTGAQC 
AGTGAGAOTCAAGAGAGTCTCAAAOT^ 

ACACTGI IG l CjA GTGAGO^GGTPGGCAOGCjTG^I G I Gro OGGGOblGIGIG^ 1GAGPCA 

D\C] 

GIGCIGIXZCTCT GA TCTCT CA GGACTCAGGTOCTCA A I I GC I CIGI IG I G ACTCAAGOOC 
AGCTGAAGGTGCTOGAAGTGC^^ 
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ATCAAAGGGAATTGATAG 1 1 G I GTAAGAATAGATATCTGCCG 1 1 1 1 1 nCTTAAGGCAAGAC 
AQCTTTACOCrPCCACTAA I 10 1 I I LA I L1 1 1 1 MTATCATTTX5GCTTCATTTADVCj^AT 
CTTnATTAAAGCAAAAGTOV^ 

AQGACTO^GGTOCnSft A T TCC T CIG I I G l W CTGAAGCCCAGCTGAAGGTGCreGAAGTC 
OVCTGAGQCTOGAGGAAGAAlQGAGT AAQVGCAGAQGGTOGAnn SAAAG GGAATTCATAGTT 
GTCT AAGAATAGATATCTCCLG I 11 11 i G 1 A AGCXIAAGACACjCrrTAOGCTTCCAGrAAT 
RSI 1 1 LA I L I 1 1 TA ATATOVTTTOGCTrOVTTTAGA^ 

GCATGTAAGCTQGTATTTTCAOCACA 1 1 IGILIGLIGI IGIGCLI PCTOGCTGAACTGAA 
[G f A] 

TACTGGCTT ACTCACTAGTAAAT A I G 1 1 1 1 L 1 (V VCAATTATAGGGAAGGGAAGAAAAGGA 
AAGTGCAATTAAAGCATTTIT^^ 

AAAGAACAATCrAAGTTTTTWATP^ 

CAI I I G I I A OCnOTATCOTATTATAATTA A I 1 1 1 G i A TATCTOGTAGTT A I 1 11 I I GIA C 

TAAATTAATCAGOGG I 'CACTA A I C i I I GLA TAATCACTCrATreAGCTOGMCr ATOCTT 

AGTATTTTCGAAAGCAAGTCAGTTjAGT^ 

TTAATX^TAAGTT^VATAGAAACTAGTTCGCr^ 

GATCATTCrGTAAGTCrTOO^^ 

Gill IGIGCTAGAI 1 1 1 LAGATAGGCTTTAGAGAAAGTTCAGAGCL 1 1 IULIL1AGCTOG 

[T f G] 

GATTAACA^A/GCTTXXrrrX!ATACnTAAA 1 G 1 \ IGCACaTTX^TCTA TCCAl I I 1LACTTAC 
TAGAATTAGGTAAGTTAG I G 1 IATAAAAI RGG1 I IGAGTtjPGG ATTCTT TAGGAAgnGAG 
IL I 1 1 IGGIGQCAGCAAI I I I G I I A TGCATTAAATAGATACATA 1 I I rGAACTAGTOGAC 
ATTCrrTOVGTXTRnATTTATTAGATCC^ 
G GbG I LI 1 GGAAAAGTTCATCGAAAAMTCT^^ 

I ILI I I LG rAATOTAAAAAATATTTAAAGGGCAGCT A 1 1 1 1 111 I LA CTTAATAATCTA 

AAAAGGACTGCATTCACATGAT^^ 

ATCGCTCXHATOVVCTCACAAA^^ 

AGTCTAT A i 1 1 I LA TT A I l l I I IAATTTCA1 I II I IAGTGAAI 1111 IGAOSTO0CCTTC 
TATACATirTAATOCTAAOGGAATAAA^ 



TAGGTACAGTCTCTCAAACACAGAG^^ 
AQGAGGnOOWVGAGAATAAATCAGATTCCTATCC^ 

CTT ATTTATO\TAATTAAAATAATGGCA 1 1 1 I CATAACA^ACTTCACL I GL I IATCTACA 
TTCTAAAI RSI IOCJLI I lATAAGCntXXjGGAGATAAAGTAATTCAG^ 
(XAATCATAATTTTCTGTOVTTAT^ 

TTTAGAGGATTCGATCAATAG 1 GGI G GT S OO^TAAAG^AATTTAAATATOQCTX^TA^ 

TttTATATTTAAGAAAGAO^WVG^^ 

TC^CTTCAGQCAGCAOGCTTTO 

TCTtTO^GCAGTT^TCG^ 



TTTTTTTTTTTTTOW^TAGG^ 
G*QGGAGG*GGTCyV\GTTAC^^ 



QVGAGAA&VAAGT AAGAATGGGAGCAAATCTAGGTAGG I I PQGTCAGGATCAGGAAATOG 
AGGGGAAGAGGTCATTAAATGTGGT^^ 

GCAAQGTXnTTTraGATCACATCGAAA 

I 1 1 1 1 ICjI HG1 RG1 1 nSI rrcm IG G^GTTTTAACTOCAAGCTAAAACraGGGrrnC 
ATTTCAGGAAO^VGTAATAGfrAA Al 1 111 I A TGTAGATTCAGCAAAATCrAGTAGrGAGr 
GGTTAO 1 rOGa 1 1 1 C ATTCTGQGG AT 1 GTGIGIG1 G 1 G AGTACATCCACGCA L 1 1 G I G 
TCnTAAGOGTGTAAGGGVGACAGACAGTGGCnACAGG I I 1 1 IGAAATOGALI ILI IGGC 

[A,n 

AAAGTAATAGOGAAAAAGAGGAATACAAATA^ 

ACAa^VAACAGTCAATCOJOGCK^ I ICI I nGTOOCTCCAGTCl IG 

I IGLL I G I LI I A TCATCATC L I GG I 1 1 I CA GC3CAAC3CnGAGTGAGrAAAAGC3QGGGrcr 
(^GGTCTOVGTGCOGCGTOGCTl^TATC 

O^GGAAGAGGAGAGATTTOGAGGAAGAGl IOL I L I 1G I UM I I I I A TTCGPGAGTCATCA 
ACTGCAACCTanTAAAGCTACATW 

CTGTATCTAGTGGAT70A I 1 1 1 1 1 1 1 AAATAGT ATTTAATTGTA I G I ILI ILIACACTT 
TTTCCT A TGTOCATOTAO^TAGCTGTGG L 1 111 1 1 CG I GGAAGTGAGGACAGATTCCT 
TTO^CATCrcCAl 1 1 I I G I G I L I GAATTAAAAGATOGACAAGTATCATCTATTATCrTA 
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GTAGTCATCAAACAAGGAAAAAGGI I ICI I IGI I RQC1 nSTTTTm IAGATGAAGTCTC 

CT.C] ^ __ 

QCGOVQQCraj^QOGCAGroGCAC^ 
AAGOVSTPCTCTCOCTOVQCCrcCTXaACT 

GQCTAAI I 1 1 RSTA1 1 1 RGAGTAGAGAGAAGGl 1 1 I bC CAQOTTCGCCAQ X I GG T C I I G 

AAOXXnX^CCTOVGGTWTCO^^ 

IT^GGCACTCTOGOOGGCC^ 

31455 ACTTAA1 P0G1 IGIGGGCAGQCAG A I C I I I I A AAQGTAAATTTCA A I I RCTC1 I IA AGAA 
MTOGCAGAOVGAAGGATGGGGOVT^ 

AAAACTAAAACATTTTAAGCCraGAAl IGU I I AOCAGT ACA7CTATTCATTATTTAATT 
TTGKXTrrAGAAGWWGTTTSQ^ 

ATTCATAAAAGAAGnGGGAAGGTCAGGCCAGG I G I GG I GGGTCATCOCTGTAATCCGAGC 
[A,G] 

CI I nS QGAGGOOGAGCnira 

AanTtyVAAGCCCGTCTtT^ 

GCCTGTAGTCXDCAGCTACTTC^ 

QVQGTTGCAGnyVGCCCAG^TCAAGCCAC^ 

CiCGTiaUWWWWWWWVWW^ 

35653 CTAATAGAAAAAAATATATA I I 1 1 PGTCAAATA1 1 ICIGI 1 1 lATTCflTTCATTCAAAGT 
ATATTTOGAATGTTTATTTCCAGGW 
ACrO^CTTTAACTGAGOCA^ 

CCICI IACTCGUGI I GGAGGAATTCAGGCAC I C I GGTAAGOCTXnAGGCCTGGATTAGT 

aAGTTCATTOGCAGCAGCATTA^^ 

[T,C] 

QOTAATOOCQQCACI I IGGGGGGCTOVGTTCGGCCIGI IGU IGAGCCCAGGAGTTtAAG 

AOCAGGCTGGGCAACATAGGAAAC^^ 

TOin3QOCTXnX5CCTOTAGrAC^ 

OCCAGAAAG1TGAGGCTGTOG I CAQCTCTWTTATQCCACTQCA00C3CAQL I IGGGIGAT 
ACAGTGAGAGCCTCTC1 lA AACAAAGAAAAGAGATTCT A I PSTC1 1 1 IGAAAAACATAGT 

42700 AQL1 1 1 I C I CGIC I C I A aGAGGCGAAAAGGGG(3*GT^^ I 1 ICI ICI 1A 

AAACAGGGI I ICA I I ICC I PGG AA G1 I RSI I I C I I I G A A I U I ICIG I CA GnTAACTCT 
TATO^TDNATTGGTTAGCATTOAATAATA^ 

CTTAGGAAGAGGCAGTTCWAGCI II 1 1 1 A7CTCCATTAT rCTGCTAC 1 1 ICC I ICICAT 
TTTAQVGATCAGGAAAATGAGGO^CAGA G I GG I I A ATTA A I C I GI I PGAGgTOXOTAGC 

[A,G] 

GGTCAGTT^TCCCAGGGTTOV^^ I I ICI IAA 

TT A I I I C I I C ACAATCAT A I fa 1 1 1 A ATC ATTACITATTGATTATTTAG I'GGTCTGATAAG 

GGTTrcAAGAAAAAATATAGATGTTAATT^ 
GACX^GTTXjCAa^^lGlGCCACKX^ 

45516 GTTACTAGCT ATCTA A I C 1 1 G AGAAAACT AGTCA A TCTCTCTC TO CCTTA A I 1 1 I CI I A G 
GT ATAAAGCAGATACTAATTA7GCCATCATAGGG 
ATTTCTAAAACACTTAAAAO>GTGAOGACr^ 

TCTATGGGAGCTCIGI I I I CACTCTATTAAA I C I IGCAACTTCACAC1CI ICCAGTCTGT 
Gl I I G I f A TOGCTCAAGCTGAGCI I lOGCICGCnOTCCACCACIGLIGI I PSCTOGGATC 
[G,A] 

CAGACcasccGCTGAcrra^ 

CXAGCGAGGIGGTCCO^TTCCOGC^ IGI ICCTCC 

AOQGGCTAAGTOGOCTOGGTTCATCCTAATTO 

ATCGQGCAAGGTTOG A I ICCI lO GAATCTt^TjAGGQGAAGAAGGGCAGGTCAGAGAAGAA 
GAGGClTGGCm^T^^ 

51789 fiGMOUaOGCTA^ 

AGAAATAGAATSGGAAAC3CTCATGAGGAOCTA G 1 1 I C CI CC I CA GGATCGCrAGOCAOCA 
AAGAAGGAAAAATACTTTTOCCTXSCAG^ 
TAGQCATTCATAGGAOOCATCAGATOGOGAAAT^TTATT^ 
AAACTATCAAGCAGATAGTOVGAGOCT^^ 

[C,T] ^ 

AGQCCATCCATTTCAATOCCreAA \ C I 1 1AACCTOU I G I I A A GI I IG I CIC I IACAGAA 
TTGAAGCTGTAAAGCTACAAATGGI ICI I (^AATOCAKXXXACATGCAGTXTATGACTC 
AAATCTACaXBGAOXTT^^ 
AAGGCAGCXXrrcXXEAGGAAATC^ 

AAGCAGTTAGAGCX3GCXIJTTOGOCAACCrcO 1 1 1 ICCIGI I GAGA 
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52042 GATACTCAG4GCOCT^^ 

TTAATTXCTOVATCnTAAGC I CC I FG1 IAAGI I IGIC1CI IAGAGAATTCAAGCTCTAA 
AGCTACAAATOGrnCTTOV^^ 

GAocscrroGAOGQoccroc^^ 

GOGAGGAAATCITAAGTOCATCACCC^^ 

M ___ 
GGCOnTCGCX>WCTtGC^ 

GACAGGACTAGCTGGATTTOC^^ 

ACR^TO^OCTTTAAACAOQQQQ 

TAAAGAGAGCTtACTAAAATGCTMTT^^ 

ATCTATCACOGAGiVGCAGAGGGa 



52139 CTCTTACAGAATTXWVGCT^^ 
GTCTATlttCTCAAATOACG^^ 
TCATCATATOVMGGCACODCT^^ 
AGnOVGQWQGAAGCAGTTAGAGCGQa 
TOCTCTTCAGAGGGGTTCC^^ 
[T,C] 

CXCTAAGOCT AGCTOGGAAGCTGACTGCATCCAC^ 

ClTACACCOGACCAATCAGCTAOrr^^ 

AGGAAGT AAAGAAATAGCCAATCATOATCAGCTCAGA^ 

C^GG^TATAAACXjCAGQOGTTCTAQOQQQCAAO 

TCTATOGGAG C I CIGI I 1 I CA CTCTATTAAATCrnGCAACrGCACAAAAAOCAAAiQCAAA 

53089 GATAAGTCA ATC1 I CA G^GAACTOGOOTCAGC^^ 

CGCTTTCGAGAAI I ICI IAATAI I IGlAAACOCAGCTTTCCrAG 
AGVCXTfXCrGTCCVGCAaasrVG^ 

CTTAGTCCTCTOO IGGOCT AI I I GG I GL I H 1 1 1 1 1 1 MCI I I 1 1 1 1 I A AGACAGGGTC 

I I GU I G A A I U IGUGAGGCTT^TTOWVCTajGG^ 

[A,C] 

GCCTTCXAAGrAGCraGC^^ 

AATC1 I C I I 1 I POG rAGI FOCI PDCTAGCTCI ICIG AGA G I I 1 1 GQ GGCTTATCTATAT 
AAGAAG3V L I PGG1 PGCOO^GGGAGAGAGGATOCAOTAGAGTTAlCA^ 
ATXOOT^AAAQCTCAATTX^TAA^ 
AAGTCTAGAATATTTATO^TOT^ 

53117 TCAGCCAGCTOAC^ACTAACOVGC I C I G I GGC30CTTTOGAGAA I I ICI IAATAI I IGIA 
AACCTCAGCTTraXACra 

AAGGATTTAAGAAJCATCTATAT^ IU50OAI I RGGTC 

C I 1 1 1 II 1 1 I I I C I I I n n rA AGACAGG GTO TOG K5 AATC1 rcXnGAGGCTGGATTC 
AAACTGGGGGGCTCAAGRjATGQ GCTGQCTCAGOC I ICCAAGTAGCTGQGACTACAGGC 

CC,A] 

TXSCAGCACTUPQOCraGTOGCAGFGGnOGI IGAAIGI ICI 1 1 1 ICLI lAGrTCCTPOCTA 
G UC I rCTCACAGl I 1 1 G GGGCTTATCTATATAAGAAGGA C I PGG1 PG CCTCAGGGAGAG 
AGGATCCACTAGAGrTAOVTAG 
ACAAAGTXjJ^GGATTTCACXXA^ 

GGCTGACATACTXTTGACXI ICICAGATACI 1 1 ICUJCTOCAI ICJLCI I ICJLI ICICCCT 

53434 GTOGCAG1GUCGI IGA AIGI ICI I M ICC I IAGI ICC I IC CTAG LICI IC I G ACAGTTT 
TQGGGCTTATGTATATAAGAAGGAC I HOGl IGCCICAGGGAGAGAGGATGCAGTAGAGTT 
ACATAGCTX^COOVCATOCTXX^ 
ACCO^TACTTTAOVCAAAGTOrAGAA^ 

CC I I CIG AGATA C I I I ICQ CTCTOttTTOO C I I ICC I ICKXCTCCTOGCI II I I I U 1 1 
[-,T,C] 

I I C I I CI I C I I I I I I I I I I 1 1 1 1 I lA CTCTGAAAAACAAaTATATACAGAATAGrACAA 
AAAQVTAOCFGTATAGrTTGAAGAGrAATTATTAAG^ 

ATOCATGTTACTtXIAAAAGACATCAjOCTTA I ICI 1 1 1 ICI ICI IAAI I II I I ICI I 1 1 IC 
I I ICI I IAT 1 I rG QQQ Cl 1 1 I ICA GATCTAGAlQCTCCAGAG A l C I I C I I CI I NN I G AG 
ACAGCATCnOGa C TGTC A OCAGGCreGAl^^ 

55431 

cro^GccraxAAGTOGna^ 

A l IG I I I 1 1 A ATAGAGGTAAGQGTCTCACT A 1 G I nS OCTAGGOCAGTCTOGAATTOCAGG 
GGCTCAAGGGATCCI I I IGOLI I G l CC I CO CAGAGIGCrCGGATTTAAGrTCGGAGOCAC 

CT.G] ^ 

ATAOOCACXXMCATAATTXiAATTATTTMTATTTO 
GQGATXnGATCTrTOGGTGAATAATA^ 
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AGTT ATTCTAAAAAATTAATTATTTACCrrTATTT^ 

TCAAG^TATTTTOCTATTTnAOT^ 

AAATAAAATATAATAGCWVOCAi^^^ 

55905 AGCATAATXIA/WC^TATTTTCGTATT^ 
GATTAAAAMTAAAATATAATAGCV^^ 
CAATACCAGCTQGGTtXX^ATCCCTCAGK 
AGGC«*7O\!0Crcm^ 

CTCTCCZCAAAAATACAAAATAAGTTGQCj IGIGCjI GGTCCATGCCTCTAATXXCAQCrACr 
[C,T] 

TO^CACCATTGCACTTXAGTrTCG^ 
AAACAAAAAACACACTACCATTra 
ATO3«XAAAATAGCTATAAGA^ 
TAAAAC^WXAAATAAATCGACAG^^^ 



60567 TAAATATOVCATTAAAAGCACAAA^^ 

TAAAACxnTarnxscrrowusoGO 

AGATA AI MM 1 G GAAATCATtnATTTTAGAAGACl GGTXrTCCAGAATATATAAAC^ACA 

CTTGCAACTCAGCAATAAAAAQVCA^ 

TAGACATTTTCrrOWVGA^GAC^^^ 

[C,T] 

TCATTOCTOmAGAGAAAT^ 

AAAGATGW^TAACAAATCTTX 

TQGATCGGflATGTAAAATX3G7XX^ 

GCACATOXACAGCAA^ 

OVGQOCXXKKjGCGGTOGCTCAAGCC^ 

60751 CAACn^GCAATAAAAAGACAAGTAA 
CATTTCTTOWVGAAGAO^^^ 
TTOGTCATTAG4G*A^^ 
GATOVACAATAACAAATCTT^^ 
ATCQGAATGrAAAATGCTTOCAG^^^ 
[C f T] 

ATCGCACAGCAATTCTACTTTT AGCTCTATAOCCAAGACAATTAAAAAGATATATACAGG 

CCGGGOGOQGTCGCTCAAGCXrTCTMTO^ 

GGTOVa^GATOiAGACCATCaXS^ 

AAA^WVTTAGCTCQGajTCGTC^ 

GGAGGAGA ATC TOCTI j AAOQCX3GGAG^ 



60755 TOGCAATAAAAAGACAAlCTW 

TCTTCAAAGAAC^VCATATAAATCACCAA^ 

TOVTTAGAG^TCCAAATAG*^^ 

AAOVKTAAOWCTXHTOX^ 

GAATCTAAAATtXTTCCAGACACr^ 

[G,A] 

OVCAGCAATTCTACrTTTAQ^ 
GOQOGGTGGCTCAAQCXITGTAATCCCAGCAC^ 
AQGAGAT0G^GACCATCOX3GCTAATACAGTG 
AATTAGCTXXjGOGTTjGTGGGGGGGOG^ 

GAGA ATCT0 G TC AAO0OGGCAGG0GG GGC1 R3 O^jfGAG00GAGATTS0G0CACrQCACr 



63301 I C I I I A CAQVASTTGGG L I GC I CC TCGTC GA GAGTC TCTA ACACn^ 

ACCACI 1 1 I I G CCTOV b l 1 ICI 1 1 A TT3GGAACAAGGAG^TAAG^TACTTCAA*TCAGGC 
CAGQC3Cn«nGACTT>a5a^^ 

D\G] 

TATOCTGQCCTCATATTX^TXXJroCATCCT 

TTTCAGTCAGGAO 0G I C I IC I CIAG I 1 1 CX.I C I G I AQCTATTACCITAGCCriXXIATTT 

CAI IC I I U XCACrACAG^TACTCrOVni^TAAAGGAATT ^ I G 1 C I I IATCGTTTCAAGC 

ATTCTOGCAAlCTTAGrM^ 

AAGA CIGI 1 1 I lA ATCTOnHXrGCTCXaXAA 



64573 (XTCACX^aDCATTACT^^ 

TTCTCAACTWCTAGATACCTXIATAGAA1 ' IGLIGI 1 GATATTAAGTGACTTAATXXTATG 
GQCTGAAI I IXJrunDCGAAACTTCAl I I G I IGGAAACTTAATXXXDCAGTQCAIGIGI IAA 
GAAGTCGG*CXJ I I CAG^GTTCAATAGGCT ATQMGGGL I C RiCCTTOVTCAATCGATTAA 
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TOG Ob l IC 1 1 U CAGT AGKX3GTTCCnATAAAAGGAAGTCTTTT^CO00L 1 1 1CCI IGOC 
D\A] 

(XTXATCCATGTC^TCGC^ 

CTOQCTaTTC^TCTTOGACm II I MCI 

TTATAAATTAOGCAGTXrTOGCnATTOGrrAT^ 

AATATATATOVVQCATCTAGACT^ 

TATCATOWTATCATCATCATOKTCATC^ 

GCTOSG G I I GblCA G U IU I IGUC I l A TTTTOQCAAAGAAAGAAAGCTCreGGlTAG 
GGGCAQQCTOTTT AOtXTATCACCTQGCAGGATTTGCAGGA^ 
ATATTGATCCAGATTTTTAC^^ 

ATCA C I GGI IGU I LA GAGWVTAAGCAGGGrrAGTCTAAAATCG 
AACAACTAATC^GACTAGAATTT^ 
[T,C] 

CICTCTCC A GTOCTC A I ITTTC1 I AAAAAGAAATCATCATAGGACTCAGT^^ 

ATAAACTTTAGTXriTATATTTQQa^^ 

TTTTCACAOVGGCTm^ 

TAAGAGCTTTTAAAGCTCW^^ 

GrAAATTOCTCTXniCrreAGGn^ 

TOOCATAGCTrATTATAAACCAICl 1 1 1 GAAAAGGATCAAAATAAGACAATTCTCTUTCA 
ATCACAAA AIG I U 1 I U 3CTAATAAO¥nt!AA AGQCATGATTCACAAAGAAA 1 1 PQG1 IA 
TTTCR^VGCTTTACAATAACAACATAATAA 1 1 1 I 1 1 1 1 I I I 1 1 I 1 1 1 1 I 1 1 1 IGAGAGGG 
AGTCTCGCICIGI C GQOCAGGCroGAGroGAGTCGOGGGATC 



<XC3GCCACrAa3O00GGCrA A I I I I 1 1 b I A TTTrrAOTAG^GAOGGGGTTTCAOOgrTTT 
AGGOGGGATGGTOOGATCTCOGACCT^ 



GCATATACTCAGAOTTAGAATT^ 
TCATTAAAATATAAGCTCAAGA^ 

UN I IC^V&VOQGfrGTCroGCICIGI 
TCACTOCAAGCTXjQGOCTaDOGGC^^ 
TO»\CTACAGGOGCCGGGC^ 

GGTTTCAOQGTTTTAGQOGGGATGG I C I ISA 1 C rOCreAOCTOCnGCTCXDGGCiGGC CT^ 
GOCTCCOWWGTOCTGGGATTAC^ 

[G,A f C f T] 

ATTAGGOTTGATAGCATATAC^ 
AGATATGTATTTTPCATTAAAATATAAOCTX 

OV\TCQCACATAACT AAAGATGTCAgTTAATCOUTTTAOL IC1CI n IGGATGCTCCAG 

QVQOCCTCTXnAGrATTOWV^ 

GATTTTOGGAAGGCTXnTAAGTACATTAQ^ 

TAG U I I I A T 1 1 1 1 (JL I I CA GAAAATATTTCATCTAAGreCnA 1 1 I I ICICIAAGCCAA 

TTAATTAGAGCTCTTTTTTATA^ 

CAGftGG^TO^OTAGTTCTAAGATTT^^ 

TOVGGfrTOGAGOQCrrCAAGAGQVGQG CT^ 

ATAGTTATAGCTGGAAC^VCAAAAACAG^ I C I GC I 1 I IAATTCTT 

[T,C] 

G GG1 1 IC ATCAGGAAAAGAGAG U 1 1 1 1 1 1 C I A AMTOOGGTCAGTOGTCeCTXirTtllAT 
I II I I Cj CAGQGAGTCCeaG^ 
GAGAGGCAAGAGAAAATOGAGAAAAGTAAT^^ 
AGTCWVACAAG^TGCAAGAAGAGGAAA^^ 

GATATCCACI 1 1 IAATTAAGCTGACI 1 1 1 IACCATAGIGCICI IATTTTAAAAAATCCTT 

ATTGAGGAAAAGAATC/^ATCGGGCTCTTC^ 

TTOGGCACTGAATAAATTTTAAGAAGGCAGT^ 

CRJTTAGTOCTTATTOVVT^ 

AGGTTGAAATTX^CAGrAATrtAC^ ITTCTTTTCnTTCTTTl 
C I MCM II I I I M I I I I 1 1 M l lj AG^CAGAATCTCACTOCTXXX3GGAGGrrGGAGr^ 



GTAGGATCAGCrOGGCTXtt^^ 
TO*GCXTTOCTGAGTAGCraGGATT^ 

MM rA GTAGAGAOGAGGTUr CrO Gfl TCI IGG CCAGQC IGGIC I IGATOCTGACCTCAGG 
TGATCTXXXrrGQGTTCATOCiOCAAAG TCC I G GC^TTAGAGGTCTCAGGCAQIATCOOOG 

GOCAAGGTcrrrrrTCTTXwwvrA^ 





[A,T] 
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TO^AGAOGAGCTTCAGTAGC^AAGTGAGACOCT Ij 1 CTCT A GAAA AGAAAGACACAGAAAA 
GAAATATGACTGACTAAMTACATATAATTTTC^^ 
AAAATTTCTQQGCTO\AGGTOQGTX^TXXCT 
CAACCrGGCAAAAOCTTXTITTCrACAAAAAOT 

GAC C I G IGGI I C I A GCTACTTTiGAAGATTCAGGTGGGAAA 1 I IGU I GAGCCTGQGCTGr 
[C f T] 

I I I ICIOWWWWWWWWWV GGCAAAAAATTAA^ 

GGTTTOGACTTAATATCAAACTC A 1 1 1 C 1 A GAAATGATT^TCATTTCCATAGGGCTTAAC 
TTCCTTTCCTAAGAAAATAGAGTAGTATA 

AGGGTOVTCTACCAAGACAGAGAAI I I O I I G 1 (^TCATCAGTUTTAAACTCTAAATTATT 
ACTCACT AAAATACATATMTTTTOVTAATACrTTAAAATOT 

TOGGCTCAAGGTGGGTGAICGLI nGAAOCIAGGAGrFCAAGAjQCAGQCraGGCAAlQCTGG 
CAAAAOCrTCTTTCTACAAAAAGT ACAAAAATTAQCCAGGCATOCTXSGroCACACCTinG 
GrrcTAGCTACrroGAAGATTGAGGTOGGAAAl I IGU I GAGGCTGGGCTCTCGAGATCA 
OVGreAGCTCAGATTXXACCACTQC^^ 
[A,-] 

AAAAAAAAAAAAAAAAAAGGCAAAAAATTAAATTATTAGTATGCT^ 
CTTAATATGAAACTCA 1 1 I C I AGAAATT^TGATTIATTTOCATAGGGCTTAAL I ICCI I IG 
CTAAGAAAATAGAGrAGTATACTAGG^ 

CTACCAAGACAGACAAI I ICI I G I (ATOVTTIAGTTTITAAACTCr AAATTATTAAGTGCTT 
ATCTGC^GATACIGAAGITTATATAGA C I I I UUAA I U I I A ATAATTCTAGAAAGGT 

AAAAGATTAAACATATCTATGGTTT^^^ 

TATCCAGGTO^GCAAATATTOACT AGTCT ATCAGTCAATTTOIATGGCAAAAGAACTAA 
GCTTAGGGVCTATACTXIAAAAAAACTAAAAATAAAAT^ 

ATQGAATAAATACAAATATAACTTACCATCnrCATAATT^ 1 1 CI 1 1 

TACAGCATGQGrAGbl rOCPGTXA TCGGGATC Al 1 1 PCI 1 1 PGCTO OOCAATAGTCAG 



GTCTTCACAGACCTAI I I GGI IGICGGAAAACACCTGTGGTGGGIGCIGCIGI IGGATTT 

GTTGGGCTOVTUTCCAGI ICI I I IGIAAGGTAAGGACI PGG1 1 1 1 1 PCATC1 K3C1 II 1 1 

AAAAACrcrr/VSATACSCrrAAAGrTTTACTT^ 

TOCTOU I I ICAI 1 1 1 I lAAAACnTAAGCMTATGACTTATA^ 

TAGCAGC^AATXAGAGCrnGAGAATTTCAATCL 1 1 I I II I ICI I G I AATCOCTAAGACTT 

ACTT ATCTAATCTACT ACXT AGTTTCTT AACAAAACACACATAGAAAGCAA I G 1 1 1 I PGA 
AAI I I I I C I GACCftGTGAGCAATAAAAATTATC^CATATA 1 1 1 IGATCTGACXCAGTTCT 
GIC1C I C I I I C I C I A eXXTTCTAAGRjAAACAAAATTTATTCAAAOCAAAA 
AGATGTAATATTOTIATATATTCTATTAAA 1 1 I CGI IATTTAGO IGLIGATCAAAGGCT 
ACTCAAACTTCAGAGCAAGAT^^ 



AGTGGCAGGTOC AI M i l fGCTCTA GCTOCA Gl ICIGCJLI PCTC AGGAAAA0C1 ICIOCA 

ACAACTTAGGTCAATCACAOGCATGTXIXjCI ICICIGAATCLI 1 1 1 1 GCAGATATCATPGG 

TATmSACAGGGTACTWTTAC^^ 

AACCATCTtalTACrcAAGGGAAAAOCTTGGAAG^ 

CD\TGATAAAGCTCATAT^^ 

I 1 1 1 I I II I I I I I A AATAGAGATCG GG 1 1 I I GC T A I GI IGQCAGACTGGTCrCA/VGGCAT 
CCTOCroOCTinQGOCACXDCAAAGTinTOGGATT^ 

AGGAOCAGA1 1 1 1 IAATAI ICI I H CJCAGAATXnATOGGTACACAGTAG I IGLI IAATA 
TCrrOGCTAAACAAAGAGTOGAGATTCAGTAAAGGGTW 

TTOGGAAAG0CTAGAAGTC A1 I C I I G AGGCTCA7TTCAAG G I GGTCG rAGCRjTCGATTA 
[G f A,T f C] 

TAWGGGAGAAGGGCATCTCAGAGAGAGGATTGG^^ 

TAGA G I I CCI I ATGATTAGCPCAGCAI G 1 1 G CTAGACTAGCATTATTATCXIAAMTTTTA 
ATTATTAAGGAACfTTAATCTTA CI I PCTA ACAA A1 IGI I IGL I 1 1 I A CTACTCATAGQC 
I I \ ICA AAAAACrTTAAOA GI I 1 1A I ICC I IACCATA AI IGI 1 I C AAAGAAGATAATCA 
TATCATCCTTT A I C I I CjCTAAGAAATCTCCAATTAI I PGG1 I A AACTCTAAGATTATTTA 

GanTACAGCTTACAAAOGGGA 

GTCAGAATATCAI I ICI IGI IGICI IGCACAATACTGAGAACAGTGCAGrACAGGGGGAA 
G GI I GG I CIA GOGOGCrrAGGOCAGCAAAAA 

TOGrGACATAAGCnGGGGAAAAAATATAAAATTSCGGG ^ 1 1 \ I CITI l A CTAO L i I G I 1 1 
GGTAAGTAOCTOGAAAAACTOCATCAAATMTTAGA^ 

CT.A] 

AAAAAAIGI I PCATTGAGACTAGG1 1 1 1 I GGI I IGI IAATTCAATCACIGI IGATTTTAC 
CCI I CC I G GCAGGAAGCTTT A I I I C I G ^GCrGTGGAQVGCACA GI ICI CA CTCAGTCCnS 
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GGTTATTTntttCATAT^ 

CTGA GI 1 1 1 KJLC TCT A I 1 1 R3G1 F AA1 1 1 1 1 i ll 1 1 1 1 I I U»I l A ACrnTACTTOTAG 
CTACATOCR^TOT^TACT^ 

AAGGTAAG1 I I LC I U iAATATAG A I CIG I I G 1 GACAGGATTAACTTTACCATCCAACCrTZ 
TT1TATAGCI ICIGIAGTOttGAGAACATTTAI ICIGICjCI I ICI IAAAAAGATGAGTAG 

aa a i ncriTTTCTTTTTrncrrrn po cagagagg gtci psi va agi pqctcaggctqg 



AAAAAAAAAI I UU 1 1 GGTACTACCAGATGAACACAC^^ 

TKCTAATACTAGAC^TTTAOCAAA^ 

TTATATATTTATAATTTGTAMTTAATTT^ 

AAAGCAGTAAAGCAGATAAAATAAATAT^^ 

TATTGTrAATAGAGAOOTTAAGTCATT^ 

GCATAAAAATOGG^GTrATTTTAATCTAAGGCA^ 

AAATTTTT^WVGGAGAAAGATAATAOOWTTTXSGAAAAG I GU I I I IAAAAAGTTTCATAG 
CATTCGAGTTGGGGCTPGAGCATGAGAI 1 1 IGIGIACAAATCPGAICI I IGATCAACTAG 
GGAACTAACTTACCAGmAGGTCT^^ 
GCTATGTTAAAAATTCrAAGATC^^ 

[T.C] 

ATC C1 PGG I PCAAAGTGAGG1 PGGAGG1 1 PGATC1 rCFOGTASTPGAGjI I ICAAAAAG 
AAGAATTAGATTOOC I C OTI G AAGCTAAATTTAQ C 1 1 I CI 1 1 I A GGOGTTCOCACTTAAA 
ATCrTTTTTAGAAGGATAGAAATCTTA 

AAOGATTCCTAGTAGCAGCTGCATCAG 1 1 I I IA7GAATPGGG0CI 1 1 IGLCIGAGAGTTG 

rTTTCrrrrcrn i ug ga a i c i i m m i gm mgi i mgm i pgci rrcrm psi n i 

TCQGAGAGAATATCTATGCI IGCI I ICAAAGTGAGGTPGGAGCTTPGAICI ILfXDCTAOT 
TCAC3C3TTTC yWW\ G AA GAATTAGATTCGCro I IICI U I AG 

GCCnaXACTTAAAAILI I I II I AGAAGGATACAAATCTTATAGATCAATTTAGATGAG 
GCCTAACI I I C I A AAAAO^TTCCTAGrAGCAGCTX3CAT^ G 1 1 1 1 I A TCAATPQQ00GT 
TTTOQCTGAGA G1 PSI rTTCTTTTCTn PCTGGAATG rTTTTTOTTTTCTITTGTTTTC 

[C,T,G,A] 

rTTGTTTTTCrn PQS1 1 1 1 1 1 1 I PSAGAQQGAGflCl PGCTOGTCTCOCAGGCTGGAGT 

GCAGTGGTGCAATOCXGGCTOV^^ 

CeTOWCTTXCTAGTAG^ 

7CT A1 1 1 1 rA GTAGAGACAGQ SI 1 1 rG OCACATTGGQCAGGCRStnOOOGAACPDCnaAC 
CTOVOGU^TOCAOOCATOT^^ 

CAGGCTCGAGTGCAGTOGTGC^^ 
TT^TTCTCOGOCTCAACCIT^ 

ACTTA A 1 I I 1 1 PSTA1 1 1 1 lA GTAGAGACAQQ G I 1 1 1G 0CACATTGGGCAGGCT G GTT 3 X 
GAACFGCTGAGCTOVGGTGATCCAOQCAICI IGQGOXrCAAAATQCTGQGATTAOQQGr 
GTGfrGOCACCAOGOCTGGQCTOGCT I IC I M 1 1 1 II 1 1 1 I I I 1 1 I I 1 1 1 II I II 1 1 1 1 

[T,C,A] 

Aa3GCI(XICICA C I CC I CTC ATTTAG LI I I CA GGAGGATAAA L 1 LILI PGG1 1 I PCTOC 
CTACCTrX^CATCALI<JLI(JLI I AG 1 1 I LI I I GL I L A CTTCI I L I 1 1 1 ILC CAGTGAOQC 
CPGAATATCAGCA I G I OCTAQGGC I IGIOOGCTCAILI I I I I L I LCATGTATTCTACPGG 
TGCHTTCATOGAGTCTOCTAAGrP 
AATTOGGTCCAGVCOTTTCXX^ 

QDGAACTtXnX^CCTO^ 

GICTGAGGGAOGAGS0CR3GGCTCT SSG1 I I L I 1 1 1 1 1 1 II 1 1 1 1 I II I I 1 1 1 1 ML1 II 
TTAACGGC I GC I C I GAL I (XTCi CATTTAGCTTTCAGGAGCATAAAC I C I C I IC^il 1 1 IC 
ltZttACCKCACAJXy^\lJL\UL\ I AG 1 1 ILI I IGLILALI ICI ICI 1 1 I ILCCACTGA 
COOCTGAATATCAGCATCroCTAGGG C I I G I CC GCTG A I CI 1 1 1 1 C I (JCATCTATTOAC 

[T,-] 

GCTGtmTCATO^VnOCCTAA^ 

ATA A IT OGG I IX AGAO CI 1 1 1 O OCTGAATCOtXACCAGAGCTCTATATCCAGC I GC 1 1 
AOTAACATCTCCALI P SGOTA ACTGCrAG GTGI I ICAGACTTAOCOGTCTAAaXTGA 
G d l C I I GATOTAGQ0CTTAAAACTTAC1OGCCQ0CAQ 
GCAATTaXXXXTTTOVSTPGAT^ 

I PPG ILILIUXOCGCGGAGI I I C I IL I L I A TCAAGAAGOCACAGGC A I I L I 1 1 L I A AA 
CATAAGmCAL I L I GCTONGAATOCTTCAATO 
ATATCCTTACAGTGAlCCTACAAGGTOCr^ 

TOCATCGCTGRXCI ICOJLACICPGLI ILICO CATTOG CC I I I IA ATGGGGCPGACTCT 
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GACTACCIGU I G AAA C I nOCreOGTOOCl 1 1' I O OOCTXjACTATTCACAAAOOGCTCXTA 
[G.T] 

TAU(JU mUTTTTl Id I AGCACTTAATAC I I I C I AACATTATCTATTTTAC I ICI 1 1 

ATTCTAGTCATTGCTTACTATX^^ 

GGCnAAGgVTCTATnOVT^ 

CACnK3CTGCTO\AATATTTCr^ 

ACTOTATAAGTAATGriTATTTTTTCATC 

97814 TAOCTCTCCG^GCTrOCATCQ 

ATOGGGCTCACTCTGACTAG L I GC I IGA AACI ICjLIUJOr O CXTnTaXOG^ATTC 
ACAAAOGGCroCTAOTA C I OC I I MC I I I 1 1 1 lb I AGCACTTAATAL 1 i ICIAACATTAT 
CTATTTTAC I I LI I IA TTCTAGTC A I I GCI l A CTATOCCTATATTTACAQnxrTCCTAG^ 
ATCTAAACAattCAAGGGTA^ 

[A,G] 

OVCTGCCTAGCACACACTOGGT 

GAGTCT&VVGTGAlGACTGTATAAGTAA I G I ICAI 1 1 1 I I CATCATTTCGAI C I I IAAAAT 
TCTOACTnG\TX3CTATAATCAI 1 1 1 1 CAGATTCTGTACT7GCAGGACATCG I G I I ATT 
AATATTTATTOyVTACTTATTCAACA^ 

98430 GAAATTTTAAGATAAATAGAAGAAAI I ICIGGICLCICAAGTAACIGIGICI PCAGTAOC 
0OGAAAAATCTCAAA(yVG1OGGAblGCjlGIGI I I AAGAATAGGATGCAGGATGCAGA 
ACXATAACCAGQCCTOVGGTCnGCATAGC 1 1 1 GG I (JGAGCATTGAGCATAGGGCCTCGTG 
AC^TAACTGATAAATGCCAAATATCACAATGATAA^ 

GG&MGAATCAOVGTTaAOV\TCATO 1 1 I ICICAT 

[C,T] 

GQCATC^WVTGCTCTATCTC 

AOCATATXATTOGGTOWVrTTCAG 1 1 1 I I AAATTTTAGATOCAGATGG I ICICIGICAA 
GAAGATCAOGACTCATATTC^ UN I AAA 

AACTCXXrrrATAAGTGGOTTt^ L I I I G I G GCAGATAGTMTTGA C IGI ICTCAAAAGAAA 

101268 CI IGIA GTO\CTAAGTCTAAGGATCATAG^^ 

TCATGCTATCAGGAAGG G 1 C 1 1 lAGCAGCCA^ACAAAGnCTAGGGGCTGTACATTGCTG 
ATCTGTTAACCCTOVGC^^ 

TATCGCTATCATTATATCTCPGGC I C I I I IG CGG UU C ICU I G GGCAGCTTALI IGIA 
ATTAGAAAGrrTATATTCECTOVrAAGA^ 

[A,G] 

OVGCAAATTOGAGGTinATAATTlXAAA I (_ I G I IATGCAATOC 

CTUTATCTOOTATOCATCTTAAATnGAACT 

CAGTOGCAACATCTA GICI I CA GAGCATAGTTTAAG ft \ 1 1 ' l " l G QQCAATGCPDGAAO0CA 
TOGAATC GIGIGC 1 1 I G AAAAQGACA GG I I I CI 1 1 1 A GAONAATACAACATrTATTTCQC 
GC AI MCI II 1 1 G ATTTAAGATTTTAGTTAAC A 1 1 1 1 1 ATTAACATTTTAGTCrACAAGA 

1D3881 ATOOTAAGAGATOCHAAGAGAOXAC^ 

ATAGTAACTAC A I OC 1 1 I GGGTAAAGGGT AGTATAAGGT ACTTTCAAGCTACAAAAATAA 
GACAG C I I I C I A TTCC CC I I G GGAGQCC TATAACA GA A I I IC1CAAGTCTCTAAGG0CAA 
TCAAGAGTTCGAI 1 1 1 1 I IATOCAACTTAI 1 1 1 1 AATTCATCTATTATTAAAAATCTGCA 
TATCAAAAATGAAAA IG I C I IGCATACI I IGCIGIAGGAGCCAATCAI IGM I II ICI IC 

[A,G] 

TATACTOCATTAATCTCTTTT^ 

AGGAAGAAAAAGAGGTTTAAT^QGACTTAAAGlTCCAjCATGG 

ATOGTX3GAA^TOGAGGAGGATCAAAGGCATC ICI IACATGGTOGCAGGCAGGGGAGTAT 

GTOCAGGGGAACraCCCTTT^^ 

AGAATAGCACAAGAAAAACCTinOGOCATT^T^ 

103926 CTOVTTGAAAAGCTIOTAGTAACT 

AAGCTACAAAAATAAGACAGCTTTCTAT^^ 

AGTOXrrAAGQOCAATCAAGAjOTTGGAI I I 1 1 1 1 ATCCAACTTA 1 1 1 1 I AATTGATCTAT 
TATTAAAAATCTGCATATOVAAAATGAAAATCT^^ 

C A I IGM 1 1 I ICI I CA TATACTCCATTA A I C I G I I 1 1 CA CACTGCTAATAAAGACTTACC 
[C,T] 

GAGACCAGGTAATITAGGAAGAAAAAGAGG^ 

CTAGGCTTXACAGTCATCGTGGAAGATC IACATGGTGG 

CAGGOWGGGGAGTAlTnGCAGQGGAACTGGCCTTrA^ 

TTATTXTVCTCTTACGAGAATAGCACAA^^ 

A0CAG C I I GC I O OCATCATATCTGGGGATTATOGGAGCTACAATTt^ 
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107845 AOL I l UUOl l IO V U I IGlCGCrACI I I T T CTCA GA TTTTTT TOVAT^^ 

rTTTCTTCTTTTn fCTAGCl 1 1 1 C 1 AACSCrrGCATTTATTTTCCrnZAGATCT^ 
CTGCTGCAATTPGACJGI I CAGGTT (4^T<^XIAArrrCCCXlfiJTTAJCAi3 1 I I IGIGGCTT 
AGGCAGTATTCAGGAGCATT^^ 
ACTTCTITACTCATT^TOGGC^OVr^ 

[CM"] 

nSl MM PG 1 I I (JVAAATATAATATACrAQCATAQGAACTTGACAGAAGAG 
ACAGAAGAAATCTAGAGAACTGATCATX^^ 

TTATAGT AAGGTAGAOCAAG I I IGICLIGIGI I tXAAATTATAClTAGGCAAAAATAAAT 

ATTTATAGATMTTGAATACTAGTrT^ 

ATTATOCCrGTAATCTAGGGCCCAAACr^ 

109010 AGGTTTAAAOTTAAATAATAGAAATA^ 
ATtnGAAATAACTTAAAGGTATOGAAA 

GTAAAATAATOGGGAG I QC 1 1 IGIGUU IUUXM/JGUGU I LCAGAAGAAGGGGGT 

GTCrWGTCTOVACATCACOCAACAAGr^ 

ACATACATATOVGTOQGATTTAGAA 

[-,T] __ 

ATAATATCCTCAATAATAAACCAAGTir^ 

AAGAG ATC1 1 1 1 CI I I IA TATG UCjU l A CTOCTCAGGrGCrCTOCAGGGACTTTTCAG 
TTCCTGAQ LI IGICLI I I ILA GC A I I I I CnZA GAGGAGAATTClTAGC n 
CCTCAAGCATTA A 1 I GL I I M I I L I G COVGAT AI 1 1 CJL I IGCTAGGOCTTCflGCXCTCA 
GrfVGCTTJITCrC^TTATCCA 

109623 CXI PGTC AGGCA C1 ICTOVACTOGCAILUJ I A TACCrTCACAGCAACCCTTjTtn"ACOC 
AAAGCAGTGTXAXACTOGGIGLLICjLI I I ICICICJLI tAAATAAMTTCCTAGATAAGAA 
GAOCTCT ATATTOCAGGQ C I IG I LI I I G ATTTTAGGG AAAA A A A A GAAAAGrACCTATAT 
ACATAATCTTTTTAAAAATCAGrAATCrc^ 
GTAAGTTAATOCraGGATACGT^ 
[G f A,C,T] 

Tl^WVTACTXX^QGCACAAAAGCAOGAATACT 

GAGAAGTCAGVTTCATAAATAC^^ 

GAGGAGTTATTTAATOMCAGAGAGT^^ 

CAATCTAAATXHACTTAATATAGTACACTT AAAATCGTTAAAATGGGAAATTTTATCAAA 
TAGGWTTATCAO^TAAAAAATTAAAAAOT 

110188 TAAAAAGTAAGAAAAGnACTGCTTOGGCGAA^^ 

CACAAATTTCCAAAACAACjGCTAATGAGG I G I I GL I GCOAA ATGGTG AACCAAATTCTG 
AAGCAATCTGTACTGTITCAGACrQGGAAACTGAT^ 

AGTAGAGTrOVTAATTT^CTCCAAAGAGA I I ICI I I LCOACCATCCCAAGGCCATCPG 
ATTGGCAGTCCAAAGAA G I 1 1 I C I L l C I GCIC TCT A GGCTGOQTAATGCAGAGrACACA 
[AJ.C^] 

GOCTTOC A I 1 1 ICI lATOTjnXTTXTAOCAGGGIGfGblCCI 1 1 ICLIULIGAACACTG 
ACTCTATAATTAOCAC^CAAAACrAAAC^ 

CAATAAAAAAACAGTGGCCTGGGGAGCATAG^ 

ATATTAGCTGAGCATTCCAGCAC I I PGGGAGGCTGAGGGAGGGAGATCAlGCKjAGGrGAG 

111006 AAAAATAAAAATAAATAAAAATTAGCTGAGCATA^^ 
CTTAGGGGCTTCAGOT3QCAGTGA^^ 
AGCGAGACCQ^TCTCAAAACAAAAACMTA^ 
AGQCTOGGCTCACTOGCTTATCCCTCT^ 
ATTGCTTCnGCTl^GGAGTTrTAGATCAG 

[C,T,A] 

AACAACAACAACAGGAGACTATACn^^ 

TAGAGAA GI I fCTCTGTCl A GAGCATTGAAATATAAAMTGCAGAATAAT^TTTACATA 

GCATTTACATTCrATTQGTTATTATAAGTAATC^ 

ATATACATAGGTTACATCC^ 

GATAOGGGTATCTGAGGAG GIGI PGQG1 PCAG1 fCTO CAOGGATAOCAAGAGACTAATCT 

111223 CTTTGAGAGGOCAAGGroGGAGG A I I GLI I GIGL I CA GGAGTTTTAGATCAGQCTGGGTA 
ACAOSGOWSAOCACATCTCrAGAAACAA^ 

1 1 PCTOGGGATCATAGI I I GI I A CTAGAGAA G I I rCTCT G TC I A GAGCATTGAAATATAA 
AAATCCAGAATAATOVTTTACATA^ 
AGATTAATTAAAGTATACAGGAGGATAT^ 

[A,G] . 

TATAGGGGACTTWTCATCCATAGAT^ 

OVQGGATACXAAGAGACrAATCTrAATT^ 
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AATAAGAATAA&WttGG^ 

ACAGTOGAAGGAAAGCTG^TCT^ 

AACAAATTAG^TTTATOGGTt^ 

111457 ATCTAGAGATTAATTAAAGTATA^ 
G^TTTTATATAGGGGACTTT^^ 
AGTTCRXACGG^TACCAAG^GACTAAT^ 
ACTCTCAAATAAG^TAAOWVAGGA^^ 
AATCATAAGAGTO0VAGGA*AGC^ 
CT,C] 

OCT AAAAACAAATTACArrTATOGCrrG^ 

ATTAATAAAAAAAGACTCAACATOVTOGCT^^ 

AGACTTTOQCAAAGTAGOVCrrTAQOCAACA 

TTAG^AGAAAQGTAGAAI I 1 1 CATOGAAGGATGCA 1 1 1 1 ICI I CACTTCAGAATTAAGQG 
AAAAATTAGG^AOCTGAATAAGAACTAATGGOCTAA 1 1 PCI I RSI I ICI I PCAAAAATCA 

112168 CAAGTT ftCCTOV^TOG^GT^TTTAAATAATCL 1 1 ICIGAATCTATCCI 1 1 ICIGI 1 1 
TAAGAAGAAGCTCTATTAGCjI ICI HGTOl rQCrATAAAGAAATAQCRiAGGGQGGATXjAT 
TTATAAAGAAAAGAGCTTTAATTOGC^ 
CCAGCATCTGCrCAbl 1 I lUGUGQGGTCrCGGQC^^ 
AGTOGAAGCAGCAGCTO^Cr^ 
[T,C] 

ACKTTAACX]CAATOtTAGTC^^ 

GCTAAGCDVCninGATCGATCW 

TOCAACATTOGC^TOVC^^ 

TO^GAAGCCTATGTAQQCTCQQCACXSOTQQCT 

QQC3C3GAQQOVQQCAGCTCATTTG^ 

112653 AAQCCTATGTAQQCroQQCAOQCn^ 
QVGGCAGGCAGATCATTTCAGOT 
arATCTCTACTAAAAATAO^^ 

GCTACTCGGGAGGCTCAGGCAGGAAGCTOC1 l(aAACDCXnT3QGCAGAGGTTGCAGTGAA 
O^AGATTO^CCACroO^CTa^ 

[G,-] _ 

AGAAGOGTATATrAAACrrATAAAATTTAATATW 

A l I PS1 ICACI I I GG ACT A I I I I i(X AAATTCATGTACACTC G TX 5 CAICILI IAACAATC 
AGGATAIGI I C I GAGAAATCCATCCTTAGGCAATCTCA I IGI IGIGCAAACATCATAGAG 
TCTACTTAGACAAOOCTA^^ 
QVGCrr A T TC CTOCrAGGC^^ 

114155 TAAAAATTTAMTAAACTAACAGATTAAACiCrTTC 

I i I IGATAACI I G I C^WVTATCT^GCACAG^AATTATTTAGAGCCACT 
ATTAGCTAAAAATCCTAGTC^TTTATT^ 
TAAAATTXttGGGTTATTTAAACTGGGAT^ 
TCTAGGACATTTTACTAAATCn^ 
[-.A.Tj 

rTCTTI PS1 I fCTM TOCAGGGftAATT AAAAAAA AAAA A AAACAACTAGAAATAAAGGCT 
7TTAAAGAAI 1 1 1 IAGAGTTAGAAAIGI 1 1 1 CAAAATTAGCTTCTTT AAAGCATTAGCCA 
I C I C 1 I 0 CI ICIG AA C I CI ICI I I 1 1 1 C I GaXTTTCCTAGCTATXW^ 
OCAGAAAC1 ICI 1 1 1 I nOCOVSTCCl 1 1 1 1CAIGICI I AACA GTCCX ATOCATCAJTATC 
TACAO^TOGAAAGOC^^ 

114181 AAAOCrnOVrrinTCAQGAAQGAAJCAGAACAAT^ 
AGVGVVATTATTTAGAGCC^ 
TTTCTATTTTAAGATCMCTCTA^ 

ATCTAAATOCAAG L I Gb 1 1 1 VGGl I GG I M 'r TCC TCT A GGACATTTTACTAAATCTTGAT 
CTT ATCTAAATCATOCTATCTXArAGA'nGGA L I G I rTCTTI TGI I IGIAATCCAGGGAAA 
[-,T] 

T A A AAAAA AA AAAAAA CAAGTAlCyWVTAAAGGCrrTTAAAGAA 
G l I I I CA AAATTAG U I RG I I A AACCATTAGCX AIC I C I ICC I I CIG AACICI ICI 1 1 1 1 
TOGCGCTTTOGXAGCTATCAAA^ I ICI 1 1 1 1 IOCCAGTCCT 

1 1 1 ICA I G I C I I A ACAGTCCCATCCATCATrATCr^^ 
AATOGAAAG AIC I GCIG I 1 1 AAAAAAAGAAAGAAQCAATCACCCATCQCTC 

1141S3 AanTTCATTOTTOVQGAAGGAA^ 
AGAAATTATTTAGaGGO^ 

TOATTTTAAGATCAAGTOACTTTAAAC 1 1 C I AAAATCCAGGGTTATTTAAACTGGCAT 
CTAAATCCAAGLIUjI 1 1 I GG I I GGTAATTCC I C I AGGACATTTTACTAAATCTTGATCT 
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TATCTAAATCATCCTATCFGATAGATGGAC I G 1 I III 1 1 PG1 I I 1 1 AATOCAGGGAAATT 
[A,T] 

AAAAAAAAAAAAAACAAGTAGAAATAAAGGCTTTT AAAGAATTTTTAG^GTTAGAAATCT 
TTTCAAAATTAQbl 1 1 1 ITAAACXATTAGCCATl I I 1 1<XI l lllAA lll l I I I 1 1 I I IL 
TGGCCTTTGGrAGCTATGAAATAATOGCAT^^ I III 1 1 1 I TOOCAGTCCl 1 1 

TTOVTTnCTTAACAGTGCCA^ 
TGC^AAGATCTGCTGTrTAAAAAAACAAACAACO^ 



115964 CTAGAQQGbl nSI 1 1 I b CTC Ab l 1 1 1 G CCTAGAAGACAGrTCTACnTATTTTAGTCCSCAC 
AGTOOACTCO O0CTX3QGCL I I 1 1 1 1 I IC 30 UG C I I I ll l G GCCTCTCTATAQOGOCT 

I HG1 I A QCA C T I FCTA GI I G I IAGTO\GCTGTTACT10CAAGTOT 

TTA l l I I I II I l (j AAGAG!CnXACTAA7lX m,l I II I 1 1 I AAACTATGACTCCCGAAA 

GATATATAGTOAAl I IllGI GCACOVGTATTTTATCATATTTATTTAA 

[A,C] 

AOCTTACATAAAAG^TATTAAACAAAAAGTAC^ 
AGCATOVVGGCATOGGCAAAT^^ 

AOL 1 1 1 1 1 CATTAATCQCA 1 11 1 IGLI 1 1 G CAG G I G l Gr A GCAeXACTGGACTTCACAGG 

I I IIIIIA AATAAAGTCATTTCGG GI I III I l l l l l PGAGCA1 III IIIATOGTGAAAOC 
A l I reGTGATCAGMGCTTGAGI I PCTA ATSC ATCI TOG1 rG GGTTATTOGAGCR3a I 

118100 GTGGTOGaXATOCACrrTAGGAAGOC^^ 

GAGACCAGTOGGOCAAll I I (jAAACTTCATCTCTACr AAAAATACAAAAATTAGCTG 

GGCGTGCTGGTOCACACCTCTAATC 

TCAACOVGGrAQGTGGAGGTTGOVGTT^ 

QjACAGCGAAAOOOCATCTOVAATAMTAAATAAATAAATAAATAAAT^ 

[-,A,G] 

COJL I 1 1 1 AGrrGAAAAACTAAGTTOCTAQrrAAGCATAATTTG 

I I II I I I lA AAATACXTTOVAACATTTONiOCTTATT A I I I I I 1 1 l A AGGATTACAAAGrAG 

AGCAGGGGGGAAATAATAAAOCACTAATAAAGAATMTA^ 

CTTAGTTTCATAAAAAAAAAGATrrc 

G UL I ll I AI I 1 1 PG1 1 I I I IG GAGACAGGnXIATTTTCAACOAAC llUjIijI I IIIA GG 

119631 GTAGGrAATAAAGGTACAAAGATAGATTAO 

TCrAAGGAGTGTAAATGTAGGCACTGGGGCACCTTTGAATAAG 
GQCATATrrAQGAQGATTATTCTGG^C^ 
TMTAGTTTTGAAAG^GAAGAGAAAAAGG^^ 
AAAATCCAAAATXSAGAAGTGTr™ 
[A f G] 

TOGGOVCTAT A TCQGT G GTTCT A 1 1 IGb l GU GAA A l I IU.I I G CAGACTPOGAGAACrCA 

GAGCAATCTGTGGTGraGGGGAOSGT^^ 

ATTXTCACTQGCTATCrATCGCCCA^ 

TO^GGAGAAAQGTCCrrTAAAAAAAAGAAAGCT AAAATTAATACCTC^TTCTTATTAAC 
TCTCTGOCAAACA l l l l PC TAAG CTG 1 1 ACACAG^TTnATTTAATTXTOQC^C^ 

120833 CAGOrrcCXXi^AGCTOGGATTAC^ I 1 1 1 IATAT 

TTTTAGrAGAGATCAGGrrTCACCAGGTTGGCCA^ 
GTGATCCACXX^GCTXDGAOCKXXAAAA^ 

GGCCTT AAAAATGCTTTTAAAAATCAAAACTAAAACATCTTAA 1 1 I 1 1 1 IIAAATGTTTT 
CATGAAAATTATOVCAGGACAAGT^^ 

[T,C] 

ATAACATTGOK^GAAGCAA^ 1 I I I III 

ACCATATTO ttj I GTOG I I I II I l l l ll I I G GOCTATAGATGAAACATCrAAATGAA AGAT 
TTCAAGA7GAAAAAAATAAAGAGG1 III 1 1 1 CATGTGCATTXaGOGTXIACrTCAQGAGrnG 
GACGAlllllll 1 1 GGGCQGATTOCAGATTATGTO01 I H 1 1 1 IAAGAAGGTTTATCTAC 
AGGTAll 1 1 1 1 IA CA IU M i l I Cj COTATOWWVTTACTCT^ 

121125 1TX*AGCCTATAACATTGCA^ 

A G 1 1 1 1 PCTACCATATTCAGKaTGG! I III I llll l I nQGCCTATAGATGAAACATOTAA 
ATCAAAGATTTCAAGATCAAAAAAATAAAGAG b I I II I I I IA TCTGC A TTCGQGTTC A CTT 
CAGGAGTTGGACGAUUUll I I WiOOGGATTGCACVVTTATGTGGL I bb I GPGAAGAAGG 
TTTATCTACAGGTA l I Mil IA CA Q.1 M il I I G0CTATCAAAAATTAGTOXATCACGC 

[AfG] 

ATGTCIXIATTAAATGrACrT ACATGCTTAAA I III V I 1 1 1 1 1 III I 1 11 1 1 II UN II I 

GAGATCGAGrnU3atTTATT^^ 

GCAACCTGCACOTXO^ 

TTACAGGOGTUrGGCA^^ 

TCTGCAIGI IGGDCAGGllOblll rGAACrOCTGAOCrcAGGTGATCnGQGGGGCTOGGC 
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121245 



121521 



124296 



124549 



124858 



125920 



ATCAAAQ\TTTOVAGATCAAAAAAATAAAGA^ I RSI ICICATX^XXATTOGCGTCACfT 
CAOG^^Gm^VCGACT G a c i 1 l\ 3 GO0GG^TTXXIAGATTATGrooCrGG I G fGAAlC^AGG 
TTTA7CTAQVGGTAC 1 Mil TACAOLI I I I I I CJGOCTATCAAAA^TTACTCTXZATOVC^ 
AATGrCTCATTAAATCTACrTAOVTOCrTAA A I I C I 1 1 1 1 1 1 I I CI I I CCI I I MCI 1 1 1 
TCAGATCGAGTTTOGCTCIT^ 



GCAACCTCCACCTTXGSGGTTt^^ 

TTACAQQCXjTGroOCACCAC^a^GQCTAA I I 1 I IGIAI I I I I I lAGTAGAG^TCGGGTT 
TCTCC AIG I IG UCA QGCreblCI IG^ICClGQCCTavfiGra^ 
CTCOCAAAGTOGCTTAAA"! ICI ICI ATA^AAATC^^AATATTTTOACAACATAACrTC 
TATAGQCA b l 1 1 I l U ttGQVCAAMTTA^ 

TGCAATO3CAOS*TCTt^^ 

GCCTTAGCCraX^ I 1 1 1 

GT A1 II 1 1 1 rAgTAGAGATCGGGI I RCTOCATCl lUjICA OG C I U^IC ri G AACTCCreV 

OCTOVGCnT^lCTXiCmX^ 

G4AATAI I I ICIACAACATAACnxnATAQGCAGm 

[G.A] 

TTTOG b l I I rA AACAlT^GAAATTOGCAATTj^A ACAACA A l I IC I I I CI 1 1 FUTOgTOGA 
ACFGCAOCAAAOCAGAATOCjI 1 1 ICATOCAI IGCI 1 1 1 fCTATCAAGAATGl I I II ICGT 
GTACTTCK^TAGTOVTCTXS^ 

TCTCJTGTGACI 1 1 ICAGCI ICI I IACTOCAAATTCGCICCICGIGI I I KdQGGTGAGCAT 
AAACAAATCCTAATTCCAAG^TC^^ 

GCATCAAGTAACTGGAG^^ 

AAOVQGGATCAGAGTG1 1 I CrAGGGGOVGGAGTGGT^UQCAGTGTCAGrAGCIAGTGAin 

AAGAGGCTGAAAGCTT^^ 

AATOCTTTTnACAGGT^^ 

AA C1 1 IGlCCAAATCCTTCGaCT^ 



AQCjTTTTCAQGATTAAGOCRJTCr ^TTATATTATACATtyVGAQQCAGTATTTXaAGCAffi 

GOCTTG^WWVGAGGTCAAATTTtSGACACAGGAAMTC^^ 

AAGTrAACrrTQCATTMTTCACri^^ 

ATTnTX3CAGACAATO\A^^ 

CCTITAAGA&VSAAG^ 

TCTTOC£linT&U^ 1 1 PGAGGA 

TTAAGCCTGTCTAATTATATTATACATCAGAQQCA 

ATTAATTG^CnTWVCTAATAAG^^ 
A7X*ACTAAGGGCTCT^ 
[G,A] 

AQOTACX^TTX^ATTra^CrTA^ 
CTUTOQCnCTOXrrUKnATAG^^ 

AGAAGGAAGTGAAGAGAG1 RGCTAATCTGCTOC1 1 1 rAAAAATCTATTTATTCTATCATC 
ATOV U I I C I CA GrACnOVCTCTEG^ I I I C I IC I C I A G 

I I C I I A TTTCQVCTTAGAGATCTTWVC^ 
GTCTGKrrOT A TAG^QGPGCAQGCAGrCAOV^ 

GFSAAGAGAGTPQCTAATCPGCnGO 1 1 1 AAAMTCrATTTATTOTATCATXyVTCACnT 

GTO>OTACrTO\CTUR3GAC^^ 

GACCCCTCTAATCGACATTTA 

[G,T] 

AGTTdCCrcCTGrcGGOCAAG^ 

AGACAGAG0CTTCAAAG0CAGGATGAAGAGA O I ICC 1 1 l A AGAGTAGTOGATTTTOGQC 
GGQCGCAGroGTTOV0QCOXjTAA£X30CAQCA 

TACGCT/^TACCGCAGA^ 

C I C I I I CC AGTAATTTACAAAAGAAAATATAQQ C I I ICCIAACI I C I CC C A I C I I Gb l C I 
GO^jlTACADVQCACTAAGTTAAAAGTTAGrAT^ 

AAAGTTTAGCCACAATCraQCI ILICI I AGGCCTTATCTAA 1 1 1 1 1 GCATOCAAATCTAG 
AQCATCGTTTUraQCAOOO^A^ 

[A,T] 

AC/W3CCV&GA^^ IG OGAGGAGCCACr 
CAOGCttCTOaGCr^ 
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GATCCTGGAGTOGl IOCCTTATTOVCTAlUL»(JLICtClCI(XI I I I ICATCGLI ICC I 10 
COXAACATCCCAAACaCACAGI^^ I 1 I llaATTCTnGGCICI ICICIGUGCAGTTC 
ACroGAQCTCTGG UU I I CU ICU I > I l UU ^TCGCTGGOCTOGCerr^^ 

G0CTAAOXATCCAGGA7GCTGGA 1 1 

TTCATOG L 1 ICU I b CCCCAAiCATXXCA^CACACAj G I Cj I Uj I I I I I G ATTCTTGGCTCT 

TcrtrrccTGCAGrnGAcrccAQCTCR^^ 
ctottrgccttttoqcocat^^ 
[a p g] 

TTCAI I G U Uj I CI U I I IC ILAGACATACOtSCTa XIGLI I I L ATCTTn^ACTTCTT 
GQQGCTUTGAGAA CTC1 I CUUCI 1 l UC OCaGGAAGQGA G I ICII^GTAGCAOCCAGG 

CATCGTMTXXCrACTCTAQC^QGCTQO 

GOGAGrrcAcaxAGrrCTAa^^ 



TGW^GAGQCTmnTVCrCA^ 
ATGCATXHTTCTIOOXTr^^ 

1 1 1 ran i vagi i i g i ca tigggccagcag ^ i l i g i re 

TAGGCAGAGbl I C I GAAAGTCTCGAGACTTTATAT 

[G,ri 

GQCrAAACTTTAGGAAC3OT^ I ICI I I IATTCAGG 

TGCAGCTCATTCTTTCXTO 

ATTTCTXXAGOVATO L 1 CIU- I i IV ^GATAGAAATQGGAAAAGTAAGAGCAAAAGGAATCT 

TTTCTTCTCATCraZATACACTAACT 

TATAACAAGTATTATATATTTATGATATATT^ 

TTTATCTXjCrTTTATCrr^ 

TGTTTOVC^CaGGGAGT^ 
AAAATAAAAAATTAGOOQQQCATACTaX^^ 

AGGA7OOTTCAGQ0GAGGA GS1 I (^ GGCroC^GTGAGCrATCATOGCACCACroCATT 

CC.A] 

CAQCXTOQQCAACAAAGTXSAGAaTC 

GATTTTX^TAAAAATTACTACTTA AA RSI IAATTTCAAAOXTTTTC 

TAC A1 1 I PG ATTT ATC1 1 rA A ATCTCI I I I 10 1 C I CA ATAAATOGGAAGTATCflGGAAGT 
CTTTTTACTTOCTCAAGGTXIATAG^ 
b l I C I 1 1 CCAACeTGCOVGGCTCTAGG^ 



AAAGre^vangrrTOVAAAA^ 

AATTAGTACTTAAAIU 1 IAGL1GU PS1 IAATTTCAAAGCCI 1 1 1C IACAI I I IGATTT 
ATC1 1 rA A ATCTCI I 1 1 nGTCTCAATAAATCGGAAGXATCAGGAAGTCI 1 1 1 IAC TTCCT 
CAAGGTCATAG^G^dCTTAG^ACCTQOT^ I ICI I ICCAA0C 

IXXXAGQCTXTTAGOOCCAACAATTACT 
[C,A] 

AGTRjCATTCG^QWWjGW 

I [Gl ICI I I I 1 1 ICAAATeXXCAACTGGTA 

I ICI 1 1 I C I 1 1 IC I 1 1 1 1 1 1 1 1 rg AGAGGGAGTCTCAQOXnOGOOCAGGCTGGAGTGC 
ACjTGGQGCX^1OQQQGCACR3CAA0CTI30GQC I GC I GGGTTGACGGCATTCI CCTCOC 
TCAGCOXXJC^VGTAACraQCyOACAG I I I I 1 1 1 

GTCATCGCAGTTCCATTGGAGAAAA 

GQGAGA1 1 I PG1 I C I 1 1 1 G AAATGCrAOTGT G AAGTGCT AGGCTT A 1 1 1 1 I CA AATC00C 
AACTOCTATTCI 1 1 I C I 1 1 I CI MIN I M I G AGAGGGAGTCTCACACPOT0GOOCAGGC 
TOGAGTGGAGTGGGGQGA I C I 'CGGCTO^CPQCAAGCTOGGOC PGG GGGrTGACGGCATr 
CTOTrcCCTCAGCCTOO^ 
C~,T] 

I I I 1 1 1 I II 1 1 M 1 IGIAF 1 1 1 lA GTAC^GAOQGQCrnTCACCtilCi l I A GGCAGGATGAT 
CTTGATClXXITGACirrcGTGATCCJGO 

AAGAOCATAGTTTACnX^TTt^^ 
AAAAGAACTAAGOCAATATCAAACAG 

TOCTCeCTAGOOGACT^ 

GCTGCTGGGGGGGTGGGGCATGG1 ICCAGGACGGCTTXAIU PCTCTOC1 I IGGGCACAT 

TAACCTCTCEGAGCGrcACrATC^ 

ATAAAGTRJITATCAGGAT^VCOVGTCAGATAATOVK^ 
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TOGCACTTOCTAAACACTAAATAA^^ 

[C,T] 

TC^TCOTTTAAAAAI I I i I ILIGLIGACTAGTT^TTUTTOVbl 1 1 I 101 III III VG\ 

ATCOTOOSGCTOVCTOCAAG^^ 

TGCEGAGTAGCTGGGAT^^ II III IIIATTTT 

T/\CTAAAGA03GGGm^^ 

132895 OVGOCTOOGAGGGAAATTAAATATATAAAAAGTCA I I I IU3GGCAGGGGCTCTGGCTOT 
GOOCTAATCCCAGQ^^ 

A O 1 PGTOGTGG \ G OCTOTAGrOOCftGCrACTCAGGAGGCTrAGGQ^GAAGA A I FGG I 
G^CXJTOGGAAGGGGAGCnTCTAATCA^ 

[G.A] 

AOVGAGAOVGACTOCAGCFCAAAAACAATAl OSI I 1 1 1 I lA ATCnGTCOCTTAATCGAAA 

TATTGAGAAAATCTCrAGO^ 

AAAATCAAAGTty\ATWQGCAGTX^^ 

GTOCTAAGCATCTrGCATQGTAATTGrGACA 1 1 IGLI PGCTGGGAGGGGAGCTGAiGCTTC 
TOgnTTTXGTXXTASOTAL 1 1 1 (XI 1 1 1 II I ilA I I GGT X ITC^TCTOCATGAJT^TTOCr 

133506 TCTTTOGGGCOCTCA^ 

CXATTATOGCTCOCATAGOCTTTO^GrrA b I I Ua I (iCQCAG^TGTCTCC3CAAGCAATTC 
GA> I P C TOG (Jb GATTX X TCT C r A TACCCATCA CIG I Ib GQGCAOGCATTGCAGCTAAAT 
ATMT^TTCTO^VCnAGrrATA^ 
GTCTO^TTTATATCTAAA^ 

[G,A] 

TOOCTTACTGGTAACATI I I lAATAAGACTAGCTATTAAACAGTAIGI RGAA1 I I GUI* 
AGAGTOOCTCTTATCCITACTTiQCAGCr AAAACACT ATATAAAACAC1TTGGGAGGCTCA 
GQC3QQQW^TO^aTTCAGCTCA^ 
TATOCTACAAAAAATAAAAAATAAAAAAATTAQO 
TTXCAGATGCAlCAGGAGGC^^ 

135473 ACACCAGCACAOVGC^^ 

ATWAGTATTACAGOVGAACAT^^ 

AGAATOGATCTACACAI I 1 1 rATGTCl I GCTACTT AATTGG^G^AATGOTCAGCrATTTA 
TTTAAjC^TUTO^G^TA^TATTTAGTACAGr ATATTCCXrreGAOVTTTTAGAJAACnA 
[G,A] 

OTTmTAOTATAAGTTATAAG^ 

CCTGGAA1 I 111 lACaACiAAATTAG^AAIll IWCTACGGTAAAALiI 1 1 IOVOXAGCAGAA 
CG0GAATATAGAATCC3CATTAATAA I III I ATATACTTAGA 1 1 1 III I ICAGGTACTTAT 
AAAGCTTAGTCTATTTCCAAAATCT^ 

TTAAAAI 1 1 I III IAAI I 1 1 (sAATTATGATTjTTAAATAb I 1 1 1 CATCTTATTTCATAAAA 

136201 GACAAAAGGGGATOOACAAAAAATATAAAAATT^ 
TASTCGCAGCTAO^^ 
CTOCATCAGOCATX^TTA^ 

TOWWWWVVAAACGTCrAC ll 1 IIIAIII » I IIA AGTAAI1 1U IICTATGAAAATC 

CAGAGGAOGTACTTTATGA^^ 

[A.G] 

GCnTTATATTPCAG A l lll l I I ICT AAA AAAAAAAA A M GTC ^ 

AATCAGTOCI G r G GCTGATCrAGAOOOOCTACAGAGQCACA l I nSl II I Uj LI IGb l CA C 

A G l l I I I l O OCnOOVQG GTTA l l I CX a I ( (^ CA AACrGQGCTXXTATG ATGTOGCAnCT 

ACXTTOQCTCG^GTCCCTXXCOTATTG^ 

ATACJTAAGAAQCAAAGAGAGATCACTA 

137080 TGGTATATGMAAC A IGPC1 IAAAGTCAGAX ATI I 1 G WV\TTTGAAGCTATCTOVGTAA 

GGCTATACreAAAGGGTTQCAGl I PGG1 I A GGAGTOGAAAT AI nTCTI PS1 I A ATCATC 
TCTTCA GI II I G GTA O-IIIG I 1 1 I ALI I I II IA TC UU I Ib GAAA ll 1 1 1 I G CAAAAT 
TTAAGCOXSQGnOAGATAATAOCAGATCTACCrA 
[A,C] 

1 1 ILUGUbl lAAATAAGCTATC^TATTAAG^TATTX^ 

AOCTTCTOGGAGCAGCnTiCrAACATA G 1 1 1 1 CAQV^XZAATATXTTOAGATGAAAAGG^TC 
OCCTlOCAGGAQGArnOCnGAGllGI I C^GAAATOVTTTAAGriTACAGOb I III IllCnT 
QUbl I IICAGTCOCTTTTACT^^ I PCTCAA1 I I I I I IA 

AACTCHAACAATAAAGTAAAATAC^^ 
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138022 GAAAGCTCTOWGTTTGAGGAG^ 

TTAGAAAAACAAGAACAU I 1 (JL I C I ATOOCTAAAATTATTCTTTACT^ 

TAAAAGTrAGTQCrrm^TTCTT^ 

TOCACCTTACJCACTrC^^ 

TCTQGCTG b l I U I I 1 OCAGGAAAAAA/NATXTACATTTTCAQIATACr AOGCTTTCATCT 
[T f C] 

CTTATTATAAGCTTTTAGA^ 
CTTTCTCATrTTTCAGM^ 

ATTATAAATGTTAA 1 1 1 1 GQCTX3QCnAAGGTCGCTTTX3QCCTATMT^ I I ICG 

QVGGCTCAGGAGGGAAGATTCC^ 

GAAGATC C 1 G I CIL I ACT AAAMTAAAAAAAAAAATTAGGCCAAGCATAGroGCTCATQC 



138543 GfrCKAGCCTGQCXAACATAGCAA^ 
CCAAGCATAGraXTOVTOC^ 
ACTTCAGOGCAGAAGnCAAGAC^ 
AAATTTAAAAATTAGCXAGGCAra^ 
GAGOTAGTQGAAGG^TTTXnrTCAGC^ 

CA,T] ^ 

ACTCTAGOCrcrrcAACTCAQCAGAACSOCTC 
AAATCTTAAA1 1 I I G I 1 1 l A AGTTTTAGQVCAG^^ 
TTTTAGVW\ACTAATTCAAAA^^ 
TCrrTTOW^TTTCQCK^TTTTAQGAAAAA 

CTTOGrrAGrrrocATATT^^ 

138681 AAGAOD^GCTTOQOTAAOXTAATCAG^CCCT 
QQCATX^TCGrnQOOCAOCTCT^^ 
CTTCQGCCTAAGAGfrTCC^^ 

TCAGCAGMGOOCTC^ M IGM 

TTAAGTTTTAGCACA^VCRXXDCTC^ 

[C.T.G.A] 

AAAMT^VAAACrrrAOTZTCTAAAGAOClCTACAG I G I 1 1 1 Id I I ULAAAATTTOQCT 
G^TTTTAOi^AAAAAGTT^TCATCT 

TAAAAQVGCAGTGACAACT ATATATAACTTAC^TCrCAGCATA I G I G I I IGIATATTAAA 

CTTCAG^TATCTAGTTTT^^ 

AAlCmTTATTTnAATAGAC^ 
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. . SEQUENCE LISTING 



<110> APPLERA CORPORATION et al . 



<120> ISOLATED HUMAN TRANSPORTER PROTEINS, 

NUCLEIC ACID MOLECULES ENCODING HUMAN TRANSPORTER PROTEINS, 
AND USES THEREOF 



<130> CL001362-PCT 



<160> 4 



<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 1551 

<212> DNA 

<213> Homo Sapien 



<400> 1 

atggtgctct cccaggagga gccggactcc gcgcggggca cgagcgaggc gcagccgctc 60 

ggccccgcgc ccacgggggc cgctccgccg cccggcccgg gaccctcgga cagccccgag 120 

gcggctgtcg agaaggtgga ggtggagctg gcggggccgg cgaccgcgga gccccatgag 180 

ccccccgaac cccccgaggg cggctggggc tggctggtga tgctggcggc catgtggtgc 24 0 

aacgggtcgg tgttcggcat ccagaacgct tgcggggtgc tcttcgtgtc catgctggaa 300 

accttcggct ccaaagacga tgacaagatg gtctttaaga cagcagcatg ggtaggttct 360 

ctctccatgg ggatgatttt cttttgctgc ccaatagtca gcgtcttcac agacctattt 420 

ggttgtcgga aaacagctgt cgtgggtgct gctgttggat ttgttgggct catgtccagt 480 

tcttttgtaa gttccatcga gcctctgtac cttacctatg gaatcatatt tgcctgcggc 540 

tgctcctttg cataccagcc ttcattggtc attttgggac actatttcaa gaagcgcctt 600 

ggactggtga atggcattgt cactgctggc agcagtgtct tcacaatcct gctgcctttg 660 

ctcttaaggg ttctgattga cagcgtgggc ctcttttaca cattgagggt gctctgcatc 720 

ttcatgtttg ttctctttct ggctggcttt acttaccgac ctcttgctac cagtaccaaa 780 

gataaagaga gtggaggtag cggatcctcc ctcttttcca ggaaaaagtt cagtcctcca 840 

aaaaaaattt tcaattttgc catcttcaag gtgacagctt atgcagtgtg ggcagttgga 900 

ataccacttg cactttttgg atactttgtg ccttatgttc acttgatgaa acatgtaaat 960 

gaaagatttc aagatgaaaa aaataaagag gttgttctca tgtgcattgg cgtcacttca 1020 

ggagttggac gactgctctt tggccggatt gcagattatg tgcctggtgt gaagaaggtt 1080 

tatctacagg tactctcctt tttcttcatt ggtctgatgt ccatgatgat tcctctgtgt 1140 

agcatctttg gggccctcat tgctgtgtgc ctcatcatgg gtctcttcga tggatgcttc 1200 

atttccatta tggctcccat agcctttgag ttagttggtg cccaggatgt ctcccaagca 1260 

attggatttc tgctcggatt catgtctata cccatgactg ttggcccacc cattgcaggg 1320 

ttacttcgtg acaaactggg ctcctatgat gtggcattct acctcgctgg agtccctccc 1380 

cttattggag gtgctgtgct ttgttttatc ccgtggatcc atagtaagaa gcaaagagag 1440 

atcagtaaaa ccactggaaa agaaaagatg gagaaaatgt tggaaaacca gaactctctg 1500 

ctgtcaagtt catctggaat gttcaagaaa gaatctgact ctattattta a 1551 



<210> 2 

<211> 516 

<212> PRT 

<213> Homo Sapien 



<400> 2 

Met Val Leu Ser Gin Glu Glu Pro Asp Ser Ala Arg Gly Thr Ser Glu 

15 10 15 

Ala Gin Pro Leu Gly Pro Ala Pro Thr Gly Ala Ala Pro Pro Pro Gly 

20 25 30 

Pro Gly Pro Ser Asp Ser Pro Glu Ala Ala Val Glu Lys Val Glu Val 
35 40 45 
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Glu Leu 


Ala 


Glv 


Pro 


Ala 


Thr 


Ala 


Glu 


Pro 


His 


Glu 


Pro 


Pro 


Glu 


Pro 


50 










55 










60 










Pro Glu 


Glv 


Glv 


Trp 


Gly 


Trp 


Leu 


Val 


Met 


Leu 


Ala 


Ala 


Met 


Trp 


Cys 


65 




70 










75 










80 


Asn Gly 


Ser 


Val 


Phe Gly 


He 


Gin 


Asn 


Ala 


Cys 


Gly Val 


Leu 


Phe 


Val 






85 










90 










95 




Ser Met 


Leu 


Glu 


Thr 


Phe Gly Ser Lys 


Asp 


Asp 


Asp Lys 


Met 


Val 


Phe 






100 










105 










110 






Lys Thr 


Ala 


Ala 


Trp 


Val 


Gly Ser Leu 


Ser 


Met 


Gly Met 


He 


Phe 


Phe 




115 

J» .L. ^ 










120 










125 








Cys Cys 


Pro 


He 


Val 


Ser 


Val 


Phe 


Thr 


Asp 


Leu 


Phe 


Gly Cys 


Arg 


Lys 


1^0 

1 J u 










135 










140 










Thr Ala 


Val 


Val 


Gly Ala Ala Val Gly 


Phe 


Val 


Gly Leu Met 


Ser 


Ser 


145 








1 50 

X ~J VJ 










155 










160 


Ser Phe 


Val 


Ser 




Tl #» 
lie 


Glu 


Pro 


Leu 


Tvr 
i 


Leu 


Thr Tyr Gly 


He 


He 








165 










170 










175 




Phe Ala 


Cys 


Glv 
180 


Cys 




Phe 


Ala 


Tyr 
185 


Gin 


Pro 


Ser 


Leu 


Val 
190 


He 


Leu 


Gly His 




Phe 


Lys 


Lys 


Arg 


Leu Gly 


Leu 


Val 


Asn Gly 


He 


Val 


Thr 




J. j j 










200 










205 








Ala Gly 


Ser 


Ser 


Val 


Phe 


Thr 


He 


Leu 


Leu 


Pro 


Leu 


Leu 


Leu 


Arg 


Val 


£. J. \J 










215 










220 










Leu lie 


Asp 


Ser 


Val 


Gly 


Leu 


Phe 


Tyr 


Thr 


Leu 


Arg 


Val 


Leu 


Cys 


He 


225 








230 










235 










240 


Phe Met 


Phe 


Val 


Leu 


Phe 


Leu Ala Gly 


Phe 


Thr 


Tyr Arg 


Pro 


Leu 


Ala 








245 










250 










255 




Thr Ser 


Thr 


Lys 


Asp 


Lys 


Glu 


Ser 


Gly 


Gly Ser 


Gly Ser 


Ser 


Leu 


Phe 






260 










265 










270 






Ser Arg 


Lys 
275 


Lys 


Phe 


Ser 


Pro 


Pro 
280 


Lys 


Lys 


He 


Phe 


Asn 
285 


Phe 


Ala 


He 


Phe Lvs 


Val 


Thr 


Ala 


Tyr 


Ala 


Val 


Trp 


Ala 


Val 


Gly He 


Pro 


Leu 


Ala 


290 










295 










300 










Leu Phe 


Gly 


Tyr 


Phe 


Val 


Pro 


Tyr 


Val 


His 


Leu 


Met 


Lys 


His 


Val 


Asn 


305 








310 










315 










320 


Glu Arg 


Phe 


Gin 


Asp 
325 


Glu 


Lys 


Asn 


Lys 


Glu 
330 


Val 


Val 


Leu 


Met 


Cys 
335 


He 


Gly Val 


Thr 


Ser 


Gly Val 


Gly Arg 


Leu 


Leu 


Phe 


Gly Arg 


He 


Ala 


Asp 






340 










345 










350 






Tyr Val 


Pro 

**55 
j j 


Glv 


Val 


Lys 


Lys 


Val 
360 


Tyr 


Leu 


Gin 


Val 


Leu 
365 


Ser 


Phe 


Phe 


Phe lie 


Glv 


Leu 


Met 


Ser 


Met 


Met 


He 


Pro 


Leu 


Cys 


Ser 


He 


Phe 


Gly 












375 










380 










Ala Leu 


He 


Ala 


Val 


Cys 


Leu 


He 


Met 


Gly Leu 


Phe 


Asp 


Gly 


Cys 


Phe 


385 








390 










395 










400 


lie Ser 


He 


Met 


Ala 


Pro 


He 


Ala 


Phe 


Glu 


Leu 


Val 


Gly Ala 


Gin 


Asp 








405 










410 










415 




Val Ser 


Gin 


Ala 


He Gly Phe 


Leu 


Leu 


Gly Phe 


Met 


Ser 


He 


Pro 


Met 






420 










425 










430 






Thr Val 


Gly 


Pro 


Pro 


He 


Ala 


Gly 


Leu 


Leu Arg 


Asp 


Lys 


Leu 


Gly 


Ser 














440 










445 








Tyr Asp 


Val 


Ala 


Phe 


Tyr 


Leu Ala Gly 


Val 


Pro 


Pro 


Leu 


He 


Gly 


Gly 


450 










455 










460 










Ala Val 


Leu 


Cys 


Phe 


He 


Pro 


Trp 


He 


His 


Ser 


Lys 


Lys 


Gin 


Arg 


Glu 


465 








470 










475 










480 


lie Ser 


Lys 


Thr 


Thr 
485 


Gly 


Lys 


Glu 


Lys 


Met 
490 


Glu 


Lys 


Met 


Leu 


Glu 
495 


Asn 


Gin Asn 


Ser 


Leu 


Leu 


Ser 


Ser 


Ser 


Ser 


Gly Met 


Phe 


Lys 


Lys 


Glu 


Ser 






500 










505 










510 







Asp Ser He He 
515 
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<210> 3 

<211> 139573 

<212> DNA 

<213> Homo Sapien 

<400> 3 

aatgggtatt ttgtagactg tccttgattg ggatttgtct aatgtttttc tgatggttag 60 

ccagggatta tgggttttgg caggaagacc acagaggtaa agtaccattt tcatcacatc 12 0 

atatcgggga tacattatca tctagttgag gtactgtgtg ccattttttg caccctaaag 180 

ttatttcttc cccccactcc ccctttccat cctatactct ttggaagaaa gttactacgc 240 

atacccacac ttaaagagta aaccattgta cttcacctcc atgagggagg gagtatgttc 300 

ataaagtatt tacatttcct gcaggagaga tttgtctatt ctctcctcat tatttattta 360 

atcatttact tacatcagta ctgactcgtg gataattctt acatatgtgt ttgtttgtgt 420 

gcatgcaaat atataatcga tgtgctttct ttgcccaata atatgttgtg gacaactttc 480 

aaagtcaata aatacagatg accttcagaa cttttagagg ttttaaagta agtatctaat 540 

cagtcttcta ccaatgtaca ttatacttcc aaattttcct tatttccaac aatactgggg 600 

tatcatcttc atacatacat ttttgtgcac ttatgtgcct attcctttgt ttactatttt 660 

accctcattt ctaaggcaga ttacacttga gctatgttcc ccattccaca accaagcacg 720 

gcttgctttc cttagtttat gctattttct ctacctggaa tgctcttttc ctgtcttgac 780 

ccactgaagt cgtatgtatg aatcagggct tcggccaaag gctgtttgct gtagaggctc 84 0 

ttctacagtg tttggagaga atttaaggga ctatctcatc tctctctttt gtacatgtat 90 0 

tataacacat catctgagcc tcctagtctc tcccaggact cttttttcct accagtttat 960 

caactgataa gaggcagaaa cgagatcaat cgcactcatc tgtgtactct atcacagtgg 1020 

tgggcacatc aagtaactag catattttga ctttgattga agtgaagaat acgaataaca 1080 

gaaattaaga agcatcctca atattgcata gcaggttact cttcttttct tttacatagg 114 0 

atggcactcc atgcttcagg gagacagagg agttgaatac aggttttagt ttttgtttaa 1200 

agtgaaaacg actctgatgt agttgaaaag taatgctttc tagctgtctg ttaaaaatgt 1260 

ttgttggttg aagacttcgg aattgcagtc cagtgaggac tgaaaataag catctttggt 1320 

gtgccaaata ttcataagga aattgtatac gaatgcaaga gaatggaact gaagtaataa 1380 

aataagggct ctgatccttc agatgactta tttaagaagc caggtggcat aacgaatctt 1440 

acatattata attagtactg agaggtgaat gccaaaacat aaaacaaaca caatcgagac 1500 

aatgttagtg tgactgtgac gctgtgtcgg tgagttgagg ctaacgatcc agtgtggctc 1560 

tcctgaaggc ccaccgcgcc cgcacctagg agacgcgccc cttctgctca tgctttgagg 162 0 

cggggtgacc cacacatctg tgcccctctc tgagcaggag gaggccccgt cgcagacgcg 1680 

cgcgcagaca gcgtctgccg cgggcacctg gggccgcgcg ccgcggggcg ccccgcctcc 1740 

gctctccgag gcccaatcat ctggaggctg tgggggcacg tcccgctccc ggccacgccc 1800 

ccagccggcg gggcgggggg tgcttttaag aaccggcggc tggcagtggg ctcagtcggg 1860 

ggtgcggggc tgtgacctag aggcttcagt gtcgatcccc gaggtgttcg cgcgcgccag 1920 

ctgtcctcgc ggccgcctgc gcgctggccg cctgcgcgct gccagcccgc ccgcccgcca 1980 

ggggctccgc cgccctcgcc tcggcctcgt tagcccgcca ggagccccgc agctcctccg 2040 

ggagcccgct ggtaactcgc gtccctcgcg cttctccggc gcctgagggg cccgcctcgg 2100 

gccatggtgc tctcccagga ggagccggac tccgcgcggg gcacgagcga ggcgcagccg 2160 

ctcggccccg cgcccacggg ggccgctccg ccgcccggcc cgggaccctc ggacagcccc 222 0 

gaggcggctg tcgagaaggt ggaggtggag ctggcggggc cggcgaccgc ggagccccat 2280 

gagccccccg aaccccccga gggcggctgg ggctggctgg tgatgctggc ggccatgtgg 234 0 

tgcaacgggt cggtgttcgg catccagaac gcttgcgggg tgctcttcgt gtccatgctg 24 00 

gaaaccttcg gctccaaaga cgatgacaag atggtcttta agacaggtga ggcgcggcgc 24 60 

ccgccgaggc cagcctgggc gacccgcgtg gggcccccga gcgcatcccg cgtgtgggct 2520 

gtgtctgcct ccgagtgtgc atgtcggtgg gtccctgtgc cagagggtgc gagcaggggg 2580 

gtctttcgag ttgcagacag agcctgccgc ttctggggcc tcggggtgcc cgtctttata 2640 

tggaatccag ctgcagagct gtgtgtttgc aagcaggtcg cagaacttac ttgccgagat 2700 

cgtcctcctt tcccctcagc agagcagacg ctaacagtcc acaggagccc ttccttttat 2760 

tgtttgaaaa caaacagaac cccagaacct tcaaccccag tcatcgccct gtcatttttg 2 820 

tggtctcttt cgtgactatg ccagttatgt agttcttcac ctgcctccct gggccgcaga 2880 

ggggtgtgcg tatgttggcg gggcgggggg tggagtttgg aggaatgaaa gagatttgta 294 0 

cgaaggtcac tggagttcca aagggggccc tgcaagagtc acggttccgt gcgttcccgt 3000 

ccccccgcct ttttttgccc tctgggttaa atgtagaaaa cacgggaggc agccggatta 3060 

gggactagga tgaggaaggt gaagggttgc ttcttccctc ttcctgttgt gttttttgac 3120 

attttttttt aaccatatag taaattagat acaaaaggtg cagattcagc. gttttctccc 3180 
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tgtagagcat tattatgact ttttggctgg ttaggcaaaa aacaaatcta agaccttctg 324 0 

catgacactt taacataaat tctttcactt tatcctgcaa ggtgagcgcg gtcaacccca 3300 

tttgggtgag aaaactgtag ctcagtgaaa gtgtcttggt gggtagtaga atggcaataa 3360 

aacacatatc aactgacttc aagggctaag tgatttccat tactaaatca acctccctcc 3420 

ccatcatttg gggtaacttt atatgattaa tagtcttttt ttttaacctt gattttctat 34 80 

tatttttaga gtgaatattt cttaggtctt tagtatgcat atgaggaatg ggcaagactg 3540 

taataaattc tgagacaaag gtaatgctgg gttatgctga gagttttaaa acctgacata 3600 

aatactatta aactatttgt atcattctgc aacttacttt tcttccattc cgcatcatgt 3660 

ttgtgactta tccacataat acctcagtgt gaactgataa ctcaaattct ttccatttta 372 0 

acttaggtgg tttgcattgt ttgactatat tatactctat gcattctccc tctgatgggc 3780 

atttagattg cttccaaact cattctaaac aatgctgcaa tgaatattct tgtacactct 384 0 

cgttatgcat gtgaatacgg taccatttta acctggaatt tctgttcttt aaatagctat 3900 

tgaaactgct gtggtatgcg ggtcaatggg ctaggtacaa aaagtgttaa aaatgtagta 3960 

acatatcctt accatttaag ggaagtaatc attgtaaagt ttagcagggg agatatgcat 4020 

atataatagc aaacaaaaat agtttgttgt cttttctata tgagtattgg gtgtcagaga 4080 

gaaaagcccc aaaagaaggc agaattgaca gagttaacat ttaaagacta gttccaacat 4140 

ttaccatatt cctgcctggg attacagatt ttttaatgca gtcaagataa cagcagtctt 4200 

ttgtttatca ttgtttttgc aaattcagtt aagtagatcc tttggtgtct gtgggtgggt 42 60 

tttttttttt tttttttttt tttttttttt tttttttgag agagagagag agcaattgcc 4320 

agagagacca tagctttgcc agggatgaga attttgcagt gtcaaagtct ctacctacta 4380 

cttgtcccca aagttctaat tggctacaca atatcccaat actgggtagc tgagagtgag 4440 

ggaaggaacc tggtttttct tttgcactct gtggaacttt gtgttttcca ttttgatgaa 4500 

tatctttttt ctttttactc agttcagtct ttgacaactt tttcagtcat gtttgtgtat 4560 

gtgtgggtat atatcatata aacagttgca caggtgtgct agttaaatgt gtgaagatct 4620 

ttgtgtttct ctgcctgact gctgtatatc tatttatggt tgtgccattg cacaagggtg 4680 

cccaactcag gggtaagtgg ggactgaaaa ccagcctggg ctctgggtgc cctttgtctg 4740 

attcctacag aagggcccta tgcacttgta atggccctgt ataacacagc atctagattg 4800 

aacaatggcc attacttggg tgctaggtaa tacatatatg actgatagat gttaagggct 4860 

gaggaagaaa acagatttaa acttagtgct gaaaaaatgg ttacaagata gtctttaagc 4920 

cagttattgt tgagatctct ctctcttccc ctgtccccta cccctttctc tttccttcag 4 980 

tgcacacaca cacacacaaa ggtgtttcat gaagtccctc atctaccaca gtcactgtta 5040 

tttgagaata tctgctttga agtttgattg gtcacacttt ttcactttga tattcgaatg 5100 

ctgagtcgtc tgtgatcaag catatgcaag cttcaaatac atgccaaaaa atatctggaa 5160 

tttgtttaag ccttttattt ttcaaagttt tggtctattt tctattaccg tactcatgat 5220 

ggataatcct ggtgttagag tacagctagt tctgtctcct tgtttccatt acttctttat 52 80 

agcaagtgac tagcctaagg atatacaggg aggtggtggt ggaatggaat ctaggtctcc 5340 

aaatgatggt gcgcatttct tgagtacttt cctgtggcta agcactttag atgcgttcct 54 00 

atttaaacct taccacgatt ctctgataga ctttgttaat atctttcttt tcagatatgg 54 60 

gaactcaggc ttacagagtt taagtaagaa gtggagccag aattcaaccc caggcttatc 5520 

tgactctaag agctgggatt tttattttaa ttatttattt atttaaaata tggaatgctt 5580 

catgaatttg catgtcatcc ttgttcaggg gtcacgctaa tcttctctgt gtcattccaa 5640 

ttttagtaga ttgttgttca aagtgctgct gaagcaagca ccaggagctg ggttttaatc 5700 

attcatcata ttgcattgac tagataacat tctgcaaata cgatgttttt tatgttgttg 5760 

attaatttaa gtgttagtga ttggttgagt gctctaccat gcattctggg attagaaaga 5820 

agggtccctg tttcttggtc ctactttgtg gtgaataaac aattgcaaat tattaatgtc 5880 

tcaaactata tttctgaagt gtagagagac ttccatagaa gaacaagata cttccatatg 5940 

ccgttcaagc aaaagtctgg ggtttccttt gaagaacttt tagattgatc cacagcagga 6000 

caatgtttct aggcagaact gaggaggagc ctttcttagg ctcacttctc ttcagggctc 6060 

tgttaactct tcccacgcaa tggataatct acccaaaatt tctcaggaaa gggcctgaag 6120 

aagttcattc acactaaggt gtaagtgagt ttacacatct tactgttaat tctctttata 6180 

caaatgttta ccaagttatc taacacgctt tgttttgggc tctgtcctgg ggactggaga 6240 

taatgactga gagagaaaat gtcagctgtt tcaaagtagc ttaggatctg ttgtgggata 6300 

caaattaata acagaccaga agtaatagaa tatttccctg aaggattttc aatataacag 6360 

gactcagttt tactataaaa ggctgaaatt ctaaggtcat ttcaacaggt ggtggggttg 6420 

ggggtgggga aggcatttga cgcctctttc tctatggtta taaatctcac ttggtgaaat 64 80 

taagactttg gaaaggggaa gtaagccaac tccaagttgg gcagtagaac caatgaaaaa 654 0 

tgctgacggc atcacagtcc cattatggtg cccagctgcc aatgacatgg cactcagagg 6600 

agtgtctcac acatactgct ctgtctgagg gagcaagcta agcttgagtt gtctcttttt 6660 

ttgttgtttt tttttttttt ttttttttga gacagattct cactctgtcg cccaggctgg 6720 

agtgcagtgg caccatctcg gctcactgca accactgcct cccggatgca agcaattctg 6780 
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cctcagcctt 

atttttagta 
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gtgcatgata 

ctcttccttc 
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ttttatttta 
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tttgagtagc 
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caccttcatt 

gttgtacatc 
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acagtattag gattgatatc agtatttact tttattaagt tgaacttaag agcagctttt 10440 

tggctggaaa aaagttgtac ttatgtcaaa gttgtcctga aagtagaatc ctactcctgt 10500 

ccccagcctg aaactattta ctacatattt acttgcatgt tctttagaat attctctcaa 10560 

tagtgtctcc tactcaagtc atcagaaatg ctgtgatgtc atttttggaa aagaattcca 10620 

gagttatcac cgcctagcta gaaatctggt cttatattca aattaaacaa gcaaacctta 10680 

acaaaaacaa gctaaaccct aaacataaca tgaacagtca gctcaccaca gttctgagca 10740 

cctgccttgg cctggtgcca cccacgcagg gactgtggat gtttttattg gcagagattc 10800 

agaacaggaa actccagcac acctgggaac tgcgcagacc caccacataa gacagatagc 10860 

ctatcagtgg ctggaggaat ggaggaaagc agtgctttca aatgtacatg ccaaatgtgt 10920 

atgatcatac ctctttgtta aagtgccttc tttaacagca aaagtaattc ctcaccttgc 10980 

atataggaac taaaaaaaag tcgatgaaga aatggcttgc cttattttca agtaagaagt 11040 

cttttttcat ttcactaatt tttaattatg ggcataagta tgaaatacag attagaaata 11100 

ctgaatgtgg accaaagcaa tgtttccttt gtggaccaaa gcagtgaatc ttttttcttt 11160 

ctttctttct tttctttttt ttgttaagag acagggtctt gctctcttgc tcagactgga 11220 

gtgcagtcag tgatgtgatg gctcaccata acctcaacct cttgggctca agggatcctc 11280 

ctgccccaac ctcctgaaca gctgggatta caggcacata ccaccacacc tggctaattt 11340 

ttaaaaattt ttttgtgggg gagggtctct ctatattgcc caagctggtt tcaaatgcct 11400 

gagctcaagt gatcctccaa cctcaggctc ccagagtgtt gggattacag gtttgagcca 11460 

ctgtgcctgg ccccaaaaca gtgtttgtaa ctccctttgt cccctcttga taaacataaa 11520 

agtctcatgg tacttcagaa ttttgttgca cacatccaag tgtagtttgc ctttccttgt 11580 

gagtggcaga agacaatgtc atactctgta tttatccatc agccaaaatt ttgtcaagct 11640 

ttacttttat tttttaaatt tttatttgta tttttttttg gagacagagt cttgctctgt 11700 

cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caacctccga tgcctgggtt 11760 

caagcgattt tcgtgtccca gcctccagag cagctgggac tacaggcacg caccaccatg 11820 

cctggctaat ttttttgtat ttttagtaga ggtagggttt cgccatgttg gtcaggctgg 11880 

tctcgaactc ctaatctcag gtgacccacc cgcctcagtc tcccaaagtg ctgggattac 11940 

aggcatgagc caccatgccc ggccaagctt tacttttagt tatgttgtgg atataattag 12000 

aattattttc ttgttcttta aatacagaat tataatattg aatgtgtgct ttaaaaaaat 12060 

tagtaaatgt acccctctag aacttctgat ctatgcctct tagtgagtga ggggagctgt 12120 

gccgctttcc tcctccagtg cctccactta agaatcactc actaaggagt tttaaattca 12180 

attaaaggta tcctttagat agttaagtct aaatgaaagg tcaatgatta aattaatgga 12240 

taaaagtcca ttgcacctac ggaaaggtgc attggtctac agttcagcta gtacctattt 123 00 

tttgctaaag attaaattgc agctgggcgg ggtggctcat acctgtaatc ccagcacttt 12360 

gaaaacctga tggaggccgg gtgccgtggc tcatgcctgt aaatcttagc actttgggag 12420 

gccgaggtgg gcggatcact tgaggtccgg agttagagac cagcctgacc aacatggaga 12480 

aaccctgtct ctactaaaaa tactaaatta gccgggcatg gtggcgcatg cctgtaatct 12540 

cagctactgg ggaagctgag gtaggagaat cgcttgaacc cgcgagttgg aggttgcaat 12600 

gagccaagat catgccattg tactccagcc tgggcaacaa gagtgaaact cagtctcaaa 12660 

aaaaaagaaa accctaggtg gaaggatcgc ttgagtccag gagttcaaga acagcctggg 12720 

caacatagtg agaccccatc tctactttta attaaaataa aattaacatt aaaaagtgta 12780 

gtaaaatttt taaaaagagt aaattgtatc agcagtgttc tgcctgtgtt cagaaagcca 12840 

aaatttatat ttgtgtttca tttagattgg atccaagacc aattttgaga tgtgttttaa 12900 

tattacaaaa atagaaaact acctgttcat taaatggtga tatttattga cttttctgtg 12960 

tagataagta ttaatgccaa gtcaagtagg ttagtctgga gactgttttt attaataaaa 13020 

accttttcat cttaattatc ctttttatta gttttgactt tatgttgcaa cttcaaagca 13080 

gcatctcaaa gggttcacat ttgtcagtgg ttgtttaaac aacgacattc cacaaagtta 13140 

attgatgttt tagtgtgaat ggggcaggaa gttgtcattg ttgtcctgta gtaacgtagc 13200 

actcaaactt tgacaggagg accccagcca tgtagttatt gactgtaagt acacttccct 132 60 

taagagtaag ttgtttttta ggaataaatg ataaatgtat ggagttttag ttgtaggtgt 13320 

tatgttttgt ctccttcttc tcagagaaaa atgttactgt ggaataggct taaactgaaa 13380 

aaaggatctg atttttaata agatcagatt ctttggccat attttgatat tggttcaaaa 13440 

caaatgttta atatcagatg cacaatgtta agagctctat aatgtagtgg taacacctga 13500 

gcctcagcca gtgactacac attaagttct tgttcatttt ttgcaggggc aggggagctg 13560 

gtgaggcata aagaagggtc agaggagata atagacttat atttgtattt ttatgcataa 13620 

ttatatagga aacattaaat ccagagttga taaataattt gtatgtatgt atttatttat 13680 

ttttgagact gggtcctgct ctgctgccca ggctgaagcg taggggtatg aacacagctc 13740 

actgcagcct tgacctgggc tcaagcgatt ttcttgcctc agcctcccga gtagctggga 13800 

ccacaggcat gtgccactac acctggctaa tttttaaagt ttttttgtag agatggggtc 13860 

tcaccacgtt gcccaggttg gtcttgaact cctgggctca agcaatcctc ctgccttggc 13920 

ctcccagagt gttgggatta tagatgtgag ccaccatgtt cagctgataa ataatttcta 13980 
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atctaaaaat cctattttgt atggagaggg 
tgttgctggc cagaatctca gagggttttt 
gaatggctca tgcctgtaat cccatcactt 
cccagaagtt ctagagcagc ctgggcaaca 
aaaattagct ggacgtggtg gcaggcacct 
ggaggatcac ctagctctag aggtcaaggg 
ccagcctggg cagcagagca agaccctgtc 
tttcagcata gacaaagcag gggaaggaaa 
ctacatttaa agtaatactt cctaatgatt 
cctggaagat tctttataaa acatgagagt 
taaatgatgc tcaatagatc tgaagcctga 
atggatgtct acttgaattt aaataaaaat 
tgaatttgta attatttggg gatttggatt 
ggtggagtct cgctttgtca ccaggctgga 
tctctgcctc ccgtgtacaa gcgattctcc 
aggcatgcac ctccatcccc agctaatttt 
tgttagccag gatgatctcg atatcttgac 
tgtgattaca ggtgtgagcc accgtgcccg 
acagatgaat cattataagg aggttagctt 
tttatcaaca aagccaggga atgccaaact 
gcatgtatgc atatatacat acatacatac 
catcagtata gtctatcagt atagtaattg 
ctttctctcc ctctctctct ctctctctct 
atatatataa atatatgtat atatattttt 
acccaggctg gagtgcagtg gtgggatcat 
aagtgatcct cccacctcaa cctcccaagt 
gtaactaatt tttgtatttt tttgtagaga 
ctgaaatcct tggctcaagc aatctgtcca 
gtctgagcca ctgcgcccag cctagatttt 
actgaagtaa atagagttaa aaaaagtaat 
cctagaagag tcccaggtac cctataatca 
tggataagtg agttaaatag agaccatgag 
cttctaatta aatgttgata ttctcttaca 
aaataaataa ttataaatta ttattttttt 
tggagtgcca tggcacgatc acggctcact 
cccacttcag cctcccaagt agctgggact 
tttgtattct tagtagagac agggtttcgc 
gtctcaagtg atccgcctgc cttggccttc 
tgggcctggc agaattatac atttatatgt 
taaacgtttt ttaaggtaca tttttctgta 
taagtgtgat gttctaccat attggtctac 
tagtgagtag caggcagtgt tgtacaacat 
aattgttttt ttcccctctc ttggattgct 
ctggttcaaa gggtatgaac aactttataa 
agaaaacttg aatcagttta ttgtgccacc 
caatgtttgg ttttattttt attagtattt 
taaaagtagt ttaaaatcat atttcagtgc 
ctttagaagc tgcaaatgac ctggcaatta 
atgccagtga atatattaga gtaaaacttc 
attatcttag gccttagatt ctcacctgac 
tcctctgctg tcttagccca ggtgttgatc 
ggaaataaac catgtagaga cttgggcatg 
catccttcag aggatcaagt ctgcaagagt 
accttgcatc aacctctgga ctcgagctaa 
tggggtatgt taaaaatttt gtttgcatgt 
ctcgctctgt cacccatgct ggagtacata 
tgacttccca ggctcaagtg atcctcccac 
tgcatgtcac cacatttggc taattttcat 
tgcccaggct ggtctcaaac tcctggactc 
tgctgagatt ataggtgtga accaccgggc 
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gagggcaaat aggctatttt ttccacattt 1404 0 

acctgcatta aaaatgatta aggctgggca 14100 

tgggaggctg aggcagatga attgcttgag 14160 

tggtgaaacc ccatctctac gaaaaatgca 14220 

gtagtcccag ctactcaaaa ggctgaggtg 14280 

tgcaatgagc caagattgct ccactgcccg 14340 

tcaaaaaaaa aaaaaaaaaa ggttattttt 14400 

agattatgtt tcaaatgttc atttaaacta 14460 

taaactttag attagtctat ttatgggtca 14520 

ttattacttc tttcaataca cgggtgtctg 14580 

actttctgaa gaaatgttgt gaaattatct 14640 

aaattgtaat acatttggtt ttattggttt 14700 

tggtttatct acagttgtct ttttttttga 14760 

gtgcagtggt gcgatctcgg ctcactgcaa 14820 

tgcttcagct tcctgagtag ctaggattac 14880 

tgtattttta gtagagatgg ggtttcacca 14 940 

ttcgtgatcc gcctcggcct accaaagtgc 15000 

gcctacagtt gtcttttttt actcactccc 15060 

tccttaaaga atacactctc cttaagcggt 15120 

ttaacacttt tacctaaatt tataactgat 15180 

atgtatatgt tgttatatat gtatgggtat 1524 0 

tttatctgaa acttggggtt ctctctcgct 15300 

ctctctctct ctctatatat atatatatat 15360 

tctttttttg agacaggatc tcattctgtc 15420 

ggctcactgc agcctcgacc tcctgggctc 15480 

agttaggact acaggggcat gccgctacac 15540 

cagggttttg ccttgttgcc caggctggtc 15600 

cctcagcctc tgaaagtgct ggcattacag 15660 

tttgaattgt aaaaaagtaa cctgctccct 15720 

ctggtacaga cacctgtatt ttctgacacc 15780 

aatacattaa catttctgca gcaaaatgta 15840 

tagcttcagg tcagttcaga tcaagttttg 15900 

aaaactttgg gtttgggttt tcagatttgc 15960 

tgagacagag tcttgctgtg ttgctcaggc 16020 

gcaacctcaa cctcaggctg aagccatcct 16080 

acaggagtgt gccaacatgt ccagctcatt 16140 

cgtgttggcc aggctggtct caaactcctg 16200 

caaagtgttg agattatagg tgtcagccac 16260 

caatatttgc ttttgttttc tgtttttcag 16320 

tctcataagg cacctgctta attgtttcag 16380 

cctaggttac tcaaccaggc ctcctttgtt 16440 

atgtagcata tctgtatatg tcgtcgaaca 16500 

tccttgggtg tacgtccaga agtgagatta 16560 

cacctgttac acattgccag attattcttt 16620 

agtgatgtgc tggcttcctg aaaaccctac 16680 

gctaatttga taagtactaa tgatattttt 16740 

ttataagtct gtgttcccag tttttgagcc 16800 

tatataatat ttgaaaatac aagaggacat 16860 

attcccatag gtaatgaagg aatgcttgag 16920 

acatcttggc aggtagacca tgtccttgtt 16980 

aaggtctgtc ttagggcggg ggataggaat 17040 

aggactttgt gattcttcca ggtgacatct 17100 

agccatatct taatctcttt cagtgctatc 17160 

ttccgttgaa aatattttat taattaattt 17220 

atttatttag ttatttttat gagacagggt 17280 

cggttgcacg ctcatggctc actgcagcct 1734 0 

ctcagcctcc tgagtagctg ggactacaag 17400 

attttttgta gagacggggt ttcgccacat 17460 

aagggatctg cctgcctcag cctcccgaag , 17520 

ccggctcccc atttaatttt gttgttgttg 17580 
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ttttttagat 
ttacggccac 
tgggattaca 
gtttcaccat 
ggcctcccaa 
ttttgagcac 
tcttgagagg 
cacatgctaa 
gattgagctg 
tttctcagca 
gttgggaggc 
ttttgtttaa 
ggggctacta 
gcccaggttc 
atttgtcgag 
ggtggagctg 
ggcaatgggg 
ctctgggttg 
cgagtgagat 
caaaggatag 
aaacagcatt 
tacaatacat 
gacaggtatg 
atgtggagcg 
gcaaactggt 
atctctaaga 
ggctgcttct 
gcactgagat 
aagtaataga 
cgtcttccca 
tcgtcttaag 
catgttcctc 
gatgttctag 
tttaaattta 
atatattcac 
agctaattct 
ctataatact 
ttgcacagtt 
ttgttagggg 
aattgagata 
cacacactga 
acagatctgt 
tgtgttgact 
gtgactgaag 
cagcagaggg 
tgtaagccaa 
tcatttacag 
ttgtctgctg 
ttttctgaca 
cagagtttga 
gaagagactt 
gacttcaaaa 
ttaattttgt 
acttcaagag 
taatctgtct 
aatgggttat 
gataaattaa 
ttagtatttt 
gcttaatgat 
atgatcattc 



ggagtttcac 
ttccacctcc 
gacacacgcc 
gttggtcagg 
agtgctggga 
aaaatatgta 
tcatgagggg 
atcctgtgca 
actagagaag 
tatctaacag 
agttctaata 
tctatccagt 
aggctagtaa 
tttatacttg 
ggccgaggat 
gggaacggaa 
agatactgag 
cagaggcaag 
ttgagggtca 
actgcaggaa 
gatgcctaca 
tttgtattgt 
gggcatgagg 
ttaagggtca 
agtctcagag 
ggaacatgac 
aggtaattta 
ggccactgta 
acttccttaa 
agggtcaaaa 
acttagtcat 
ctcctttcat 
accaagagac 
aagtattgac 
agagtttagt 
tttaagtaag 
attatagaat 
tattgagaaa 
aggaggtgtc 
ataactagcc 
gcagtgagac 
ccacactgtt 
cacgtgctgt 
cccagctgaa 
tggatgaaag 
gacaccttta 
aatctgtatt 
ttgtgccttc 
attataggga 
aaaatagaat 
gatagtagta 
catggtgttc 
atatgtggta 
gcagactatt 
tcatcctctt 
ggaagaacgt 
tcagcggtca 
ggaaagcaag 
aagtgaatag 
tgtaactctg 



tcttgtcgcc 
tgggttcaag 
actatacctg 
tgggtctcaa 
ttatagatgt 
ctgagagcca 
aaacaaaatg 
agtaagatat 
gcttcagttt 
aagagggtat 
ctgaatgttc 
aatcctttca 
gtagtgaggc 
actccacact 
atgtagtgaa 
gatctaaatg 
gatttctgaa 
aaataggagg 
gccatggaaa 
ttctggcaaa 
tctggagcct 
gccccacaaa 
agaaagagtt 
gcaagacatc 
tgacgctgag 
ataatggaaa 
gttcatttct 
gctgctgcca 
aatgtctttc 
ttatattttt 
ttttttcctt 
ttgtgatgta 
tgtgagaata 
agtggtgaaa 
aactgagcca 
tatagattct 
aagtgaaatt 
tatggactgt 
atctttatgg 
acctggccac 
tcaagagagt 
gtgagtgagc 
cctgtgatct 
ggtgctggaa 
ggaattgata 
cccttccagt 
aaagcaaaag 
tgggtgaact 
agggaagaaa 
tcatgcaatc 
aaggttggga 
tataatttag 
gttatttttt 
ttggaaaaaa 
aattcatcaa 
caaggcaatg 
ctaatctttg 
tcagtgagtt 
aaactagttg 
gaaatggttt 



caagctggaa 
cgattctcct 
gctaattttt 
actattgacc 
gagccaccat 
cgcaagaaac 
atacataagt 
aggattttag 
ttgagttagg 
ccgaggtgag 
tgactgtggt 
tgtaacaatt 
tggatttaaa 
gggcttatta 
aggtcataga 
ccaaggtaag 
gagggtttct 
aacaccaatt 
tgggaatgaa 
gaatgggttg 
gtgtcttaat 
atcatgtcga 
gccagggaaa 
aagggcgatg 
tccagtggtg 
cccagaaatc 
actgaaatca 
cctcttagct 
tagcctttgg 
actatccctg 
caatttgaca 
attaagttgg 
gcttacagtc 
acctgttcaa 
actacttcat 
actctttaaa 
aagttatctg 
ttttgcacat 
cactttctgg 
acacaagtgt 
gtcaaagtac 
aggttggcac 
ctcaggactc 
gtgcagtgac 
gttgtgtaag 
aattgtttca 
tcagcatgta 
gaagtactgg 
aggaaagtcc 
ttttaaattc 
atcaaaagaa 
tgttcatttg 
gtactttgat 
aagtctggta 
gatgactttg 
taatccctat 
gataatcact 
agaactgtca 
ccttcaaccc 
atggaacaac 



tgcaatggca 
gtctcagcct 
gtatttttag 
tcaggtgatc 
gtccagccac 
aaattcgact 
aactctgaga 
aggaagggag 
tgttacgaga 
agtgtaaggc 
ttactatgta 
atgatgtgtg 
cttaagtctc 
agtgaatgac 
ctataaggct 
cttggacatt 
aagtagtgct 
acaaagcccc 
gggacagatg 
gacagtgatg 
aatttttgat 
cagccctaga 
ggctggctca 
tcaggcaggt 
gggctttgga 
ctgtcttgag 
ttatatttaa 
ttggtcttaa 
atttctctaa 
tcttaggtat 
tgactgctaa 
tctgtaagtc 
atttcaacta 
caagcagatg 
tcacagttca 
agagttatct 
actgggactg 
tcataatgga 
ctgggaaggg 
gttttgcctt 
ttttcaatgc 
ggtgcctgtg 
aggtcctgaa 
cctggaggaa 
aatagatatc 
tcttttaata 
aggtggtatt 
cttactgact 
aattaaagca 
catgccaaca 
caatgtaagt 
ttacgtgtat 
taggaaacaa 
aaagtaatgc 
ggtgtctggt 
tatttacggt 
ctattgagct 
aaactgatca 
tttcctccct 
agtgaaaata 



cgatctcggc 
cccgagtagc 
tagaaatggg 
cacctgcctc 
ccatttaatt 
tattccatgc 
gaatatgctg 
aatgacttct 
ttgggagact 
ctgggcaagg 
tttcaggtta 
tgttttaggt 
cagcttcgtg 
aaggagtttg 
ggcaaagaat 
atttaataga 
ttaaaaagat 
gaccgaaagc 
tgagacattt 
aagaacacag 
cagggagacc 
tgatgtcatg 
cattttagac 
aactggaaat 
aagaacatgt 
cggactcagg 
aagtatggcc 
aaaaaaaaaa 
ttcagatttg 
ttccaaaact 
agacttttgg 
ttatttttaa 
atttatgtat 
atgttatctt 
aaatgaaaac 
aggagagcta 
gggtggaata 
catttgaggt 
agtcagtcct 
agttacctgt 
ataaagcact 
tgcgggcgtg 
ttgctctgtt 
gaaccagtaa 
tgccgttttt 
tcatttggct 
ttgaccacat 
agtaaatatg 
tttttctcct 
catcagacaa 
ttttgatatt 
ggtattataa 
atattgagcc 
ttaatctaat 
ggcaactgag 
tacttgagag 
ggaactatcc 
gcttttctaa 
gcattgcagc 
cattgataca 



17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
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ctgtcttgtg 
ggggattaac 
tactagaatt 
gagtcttttg 
gacattgttt 
tgaggggtct 
atttcaaaat 
ttgttacaac 
aaagcgccta 
ttatggtggt 
gggacaatac 
gagatgatgt 
aaattaatct 
gggattgaga 
accagtatga 
atctagttaa 
atgctcaagg 
attgtgtgtg 
agctttacca 
aagggcaatt 
ttttttttta 
gctcactgca 
gctgggatta 
gggtttcacc 
tcggccttcc 
tcttgatttg 
cacctatttt 
ggaccctcaa 
acttgatgga 
ctttagtgaa 
gaggaagtcc 
tcaccctttg 
ctatgcaggt 
tcataacaaa 
agataaagta 
acaatactat 
agaaaagaaa 
aattttttaa 
aagtgtacag 
aatcataata 
gtctttttcc 
tgccttttct 
tttcactcag 
tcttttttat 
ttcgagacag 
ctgcagcctt 
gaccacaggt 
gtatattgcc 
tcaaagtgtt 
attcacatat 
tatgaacatt 
taaatttaaa 
atactgtgta 
taggagtgag 
ggtgctgcca 
agagggtcca 
atcggcctgg 
gaagagatag 
gtagctagag 
gagtagcttg 



gtagattttc 

aaagctgcct 

aggtaagtta 

gtggcagcaa 

cagtctgtat 

tggaaaagtt 

atttttgaac 

atgtctgaac 

tcagggtaac 

gcttgggtag 

tctaaaggaa 

tgaagaggaa 

tgttcatgct 

tcctgtggca 

tgctgaagac 

agcaaaagca 

tattttgctt 

ttttgagaaa 

gagagtcctt 

ttgcaagagt 

gttggagtct 

acctccacct 

caggcgtgat 

atgttgacca 

aaagtgctgg 

gctttatctg 

tcttcagtta 

ttctttagag 

gcttatgttg 

ttttttgagg 

tagccctgtg 

aaatttgagg 

ggaatcaaag 

cttcacctgc 

attcagctaa 

atatgggatt 

cgtagattta 

aatcagcttt 

tttgatagtt 

tagtgtgtcc 

cttacattct 

aaaattttat 

catgatgctt 

tactaagtag 

agttttgctc 

gacttcccag 

gtgtgctagt 

caggctggtc 

ggggttacaa 

tgatggatat 

tgtatgcaaa 

tttaaatttt 

cttttttttt 

aggactctgg 

ataaagaaat 

ttgaaatgag 

atatatggga 

gggtacaggc 

gaagattaga 

agaactaact 



agataggctt 

tcatagttaa 

gtgttataaa 

ttctgttatg 

ttattagatg 

catggaaaaa 

caaaatcaac 

agttcgagac 

atgaattctg 

aaggatggtg 

ccagcagttt 

gcccacagca 

gtaattgaag 

tatgtctgaa 

aaagcacaac 

gactagtcaa 

gttgactttc 

attagccaaa 

ctccaccagg 

tttgatggga 

tactttgtct 

gccaggttca 

ccaccatgtc 

ggctggtctt 

gattacaggc 

acttcttttt 

ataatgtaaa 

atggactaat 

agaaatgaag 

tccccttgta 

ctgtctgcct 

tttccattca 

agcacacggc 

ttatgtacat 

gtattatttc 

gattcactgc 

aaaatcttat 

attgagggat 

ctgacaatca 

atcaccccag 

ggctcctgaa 

atgaatggaa 

ttgagatttc 

tattccattg 

tgacatccag 

actgaggtga 

ctgtctaatt 

tcaaactcct 

gtgtgagcca 

tgggttgttt 

cctttgttgg 

gttctgtatt 

gaaattaaaa 

agagttacag 

ttaaatatgg 

tcagtgggaa 

agtgaagtct 

taagaaggga 

gtcaagcgag 

taaagaagga 



tagacaaagt 

atgtttgcac 

attggtttga 

cattaaatag 

ctggggtggg 

atgtacacta 

ttgtactaac 

attaagaaat 

ctaaaattga 

aaatcattga 

acaaatggct 

gtagaccatc 

agggaacttt 

ggattgtaat 

caaagcaatg 

gagcaaaggt 

tggggagcca 

gttttagcaa 

acaatgttcc 

aattattagg 

cccaggctgg 

agcaattctc 

cagctatttt 

gaattcctga 

atgagccact 

ctttcctaat 

aaggactgca 

ggctggtatc 

tgtatatttt 

tacattttaa 

aggtacagtg 

ggaggttctc 

ttatttatca 

tgtaaattgt 

caatcataat 

agaactggaa 

gttagaaggc 

gacttagata 

aactgtatac 

ggtgtaccct 

aacttgatct 

tcatacagtg 

tccttgttgt 

tatgcctatg 

gctggagtgc 

tccacctgcc 

tttaaattgt 

ggcctcaagc 

tcacacctgg 

tcagttgttg 

gatgtatgtt 

atttgtattt 

aaattgtggc 

ggctttggtt 

ctgatatttc 

atctctgatg 

gattatagtc 

ggaagtcaag 

ggtaacaatt 

gcctgtagag 



tcagagcctt 

cctgtgtatg 

gtgtggattg 

atacatattt 

tatgggaata 

tgaaaaaaaa 

ttgttacaac 

gatatcgcac 

agcaagaaca 

tgctttacaa 

aacatccttt 

cgtgtcaatt 

acatgaaatt 

aggagaggaa 

gctaccaaga 

catgataaga 

aagaatgata 

aacaaacaaa 

cggtcatcct 

cattcactta 

agtgcagtgg 

ctgcctcagc 

ttgtattttt 

cctcaggtga 

gtgcctggcc 

cttaaaaaat 

ttgacatgat 

aactcacaaa 

cattatcttt 

tcctaaggga 

tctgaaacac 

aaagagaata 

taatcaaaat 

tgccttgata 

tttgtgtcat 

taaatataaa 

tcaattaatt 

ttatataatt 

aatcatgtaa 

cgtgatcctt 

gctttctgtc 

tgttttcttt 

ggtatgtatt 

ccacattttt 

agtggtgtga 

tcagccacct 

ttgtagagat 

aatccttctg 

cctaccacaa 

cctattatga 

tttatttctc 

ttaaatttct 

tgaacaagga 

tagaggattg 

ctatatttaa 

acttcagcca 

tgtggagcag 

tcaaagggag 

tttttttttt 

agggaggagg 



tcctctagct 

cattttcagt 

tttaggaagt 

tgaactagtc 

aagaaacgta 

ctgtgcatgg 

atgtctgaac 

cagtttttaa 

aacatcaaat 

aaagtttatg 

aagaagggac 

ttcaaggaaa 

ttaaacaagt 

acatggcttt 

ggtggaagtg 

gacttttggg 

atatttgctt 

tacccaggga 

ctcatcaaaa 

cagtcttttt 

tgcaatcttg 

ctcccaagta 

agtagagatg 

ttcacctgtc 

tatcttacag 

atttaaaggg 

taaattcctg 

agtatcttga 

taatttcatt 

ataaagaaag 

agaccagtat 

aatgagattg 

aatgccattt 

agcttcccgg 

tatgagcaac 

ttagatcttt 

aaatgtaatt 

cacaaatttt 

ccaccatcac 

tttgcagtta 

actatagctg 

tgtatctgtt 

agtagttctt 

tttttttttt 

tcatggctca 

gagtaactgg 

gggggtctct 

ccttggcccc 

ttttttatcg 

atagaactgc 

ttttgtacat 

caagtgggta 

gataaaaaag 

gatgaatagt 

gaaagaccaa 

gcacgctttc 

tgaatgggag 

aagtagggtg 

ttgaagatag 

tgaagttact 



21240 

21300. 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 
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aaaggtctaa ttgatggggt aaggttcatg agcagatcag atctttacaa ggaaggttct 24840 

tgctgggggg caagattcaa aacccctatt cagaaccagg agagaagaaa gtaagaatgg 24900 

gagcaaatgt aggtaggttt ggtgaggatc aggaaatgga ggggaagagg tcattaaatg 24960 

tggtcctggg gttgagcagc agattggaag agaatggcaa aagtttggga gctgattagt 25020 

gataaggaaa aggtttgaga atccgatgaa gattagaaac catgcatttg tagtgaaact 25080 

tgtttctaag attgtgccct tacccacctc cagctgtgct ctgataggtg aactatacaa 25140 

ttgatgtaag gctggcagat aatcaaagcc aagaaatttc atgttttcta tctattttca 25200 

cttccgtgcc atatagcttc tctcatatag tactcatatg ttttgagttt ttggtgaagc 25260 

tgattaaaca tttaacaact ttttttttta cattttttat tgtgacaaaa catatatagc 25320 

ataaaattac cattctggta aatttaccaa aacccattta tagcataaaa tttaccattt 25380 

tggtcaactt actgtaaatt actgccagtt ctgtacagcc atttgtttaa tgcagtctgc 25440 

tctaataccc tccaaggcaa aattgcaaat aaaaacctgt tttgatttct acttaatttt 25500 

ggtaaattta ccaaaagggc aaattttatg ctataaattt tttttgtttt gacatttaaa 25560 

atatgttttg taaaaaaaat tgacagataa aattgtatat ttaacatgca taatgtgatg 25620 

ttttgtatta tgtataacat gatgtatatg tcatacataa catatataca ttgtagaatt 25680 

gttaagtcta ggtaattaac aaatgcatta cctcacacag ttatcatttt tgtggtgaga 25740 

acattttaac atccactctc tttaaatttt tcaagaataa aattttatca tctggtcatg 25800 

gtggctcacg cctgtaatca caacactttg ggagggcaag gcgggaggat tgattgagac 25860 

tagaagttct agaccagtct agacaacata ctgatacccc gtctctacaa aaaataataa 25920 

aaaagaatgc aatattgtct aaatttcttg aatttatttt tcctacctaa ttgtcgttat 25980 

atatccttta accaacgtct gcccatttcc ctctcctctc taaccagtcc agcctctgag 26040 

accattgtac tttctatttc tatgagatca gccttttaag attctacatg tcagtgaggt 26100 

cgtgtaagta tttgtctttc tgtacctggc ttgttttacc attgtcacca tttataaagt 26160 

gtacaattta gtggcattaa gtacatacag aatgctgttc acccattact actgtcctac 26220 

aattttaatt tgaggcatga ctttgtcatc tctcttctat cctttgtacc attcttgatc 26280 

tataaactca acaaagtcct taactgagta gaatattttt gtatgtgtgt atagatgtgt 26340 

gtgtgtgtat ataggtatgt atagatatag atatagatat agatataaaa tttttttttg 26400 

agacagagtc ttgctctgtt gcccgggctg gactgagtgg catgatcatg gctcactata 26460 

gcctctacca tacaggctca agcaatcctc tcacctcagc ctcccaagta gctaggacta 26520 

catgcatgca ccaccatgcc tggctaatta aaaaaaaatt ttttttttct agagatgggg 26580 

tctctgtgtt gcctaggctg gcctcaaacc tctaggctca agcagtcctc ctgccttggc 26640 

ctcccaaggt gctgggatta taggaatgag ccaccgtgcc tggccagtat ttgtattttt 26700 

gatactgagc ttaattttgg caagtgctgt gcaaggcaca aattgtggtg ttagctttct 26760 

aactatttgg ttgtaattat ttgttaatat ctgtctttcc tagtcatgga aagtccatga 26820 

gtgcaggggc tgtgtctgtc ttgtttacaa ctatatttgc gctggccagc acagtgcttt 26880 

acacatagta gacattcagt gatgtttctg aacaaatgaa agtccttcgc tgcaagaaga 26940 
atttatttca gtttaataag ttctttctga atgcttcata tgaagtaggc aaaatttgta x 27000 

ttcacatatt aatacagatt ataggtatgg cacattccaa tgtctattta attagtagaa 27060 

atcaaaaaga gtttttattt gcaattttgc cttggagggt attagagcag actgtgcact 27120 

aaacaaatag ctgtacagaa ttggcagcaa taatcccaaa gtttgaaggc tgtaggttaa 27180 

ttaggataac ttgacaggaa gtgagttaat caaatttgag agtttaatct tccaataatt 27240 

tatgttcaga catacttcaa gtatatcagc aggtaacagg aactttagtt gcagaatgcc 27300 

ccaaaacaca agaactccag tggattttct ggcttccagg aatgttttgg aggaagaaaa 27360 

accaataaaa tgatttgggg gtcattttgt tccattactc tatattaaat atactagaat 27420 

ttaaaatatt aaattttaaa agataaaaag atgcagttta cctattaaca aattaaataa 27480 

tttaggaatt ctacttagtt ctgtaatact ttaatatgag taaatatggg catttctgtg 27540 

ttagctagaa ttagatagag tattgccatt tttttcaact ggcttatggt taaatggaag 27600 

taaaggggca aactacacat ataagaatta gtagtacaat atttaataca cccctgtaag 27660 

agttatcaca gtgtgtcctc tgtgaaaagt aagggctcca tgtgtgcttg tgaaaaaggc 27720 

cactggaggc ccttttcaaa aattaaatct gcctccagca aggtgttttt cgatcacatg 27780 

gaaaggggaa gaagaagcta tcaggagctc tggggttttt tttgttgttg tttgtttgtt 27840 

ttgccacttt taactctcaa gctaaaactg gggtttcatt tgaggaacca gtaatagaaa 27900 

atttcttatg tacattcagc aaaatctagt actgagtggt tactttggct tttcattgtg 27960 

gggattgtgt gtgtgtgagt acatgcacgc acttgtgtgt ttaagcgtgt aaggcagaca 28020 

gacagtgggt acaggtcttt gaaatggact tcttggcaaa agtaatagag aaaaagagga 2 8080 

atacaaataa gggaggaggg acagggaaga gcagagtcac aggaaacagt gaatgaggct 2 8140 

gcagtctcag tcgccctttc tttgtccctc cagtgttgtt gcctgtctta tgatgatgct 28200 

ggttttcagc caaccttgag tgagtaaaag ccgggtctga ggtctcagtg cctgcgtggc 28260 

tgatatgagc agcttgcatt tctgactggg ccctggagca gcaacagcac agatttccag 2 8320 

gaacagttcc tcttgtcatt tttattcctg agtcatcaaa tttagttatt cagacgtctg 28380 
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ctgttcccag ctacatacag atcaaacaag cagggaattt ttttctttct cttctctccc 2844 0 

tctttttttt gtatttccat cttgttttgt ataccttttc tttgtttaag tcaagcattt 28500 

gaacatcact agttaccatt tcctttagca agcataggac ttctgtctta cttaaatgtc 28560 

ttctaatgct gtgatgtgtc acagttagtt gagacgttaa agatgttcca tacatgtgac 2 862 0 

tacattggta aatctcaaaa acatcatatc gaatgaaaag acaaagttgt gaaatgtctg 2 8680 

atgtgatgcc atttccttaa aaagtcatgt aaagcaatcc taccacatct catagaatct 2874 0 

aaaaggctat actgatgcta agatgcacta ttattttctg tacactgaga aagggggaaa 28800 

attgccactt aaattgtgat gtaatgtctc attatggatt gtaagataca tctacatttt 28860 

agaaatggta taatgttaaa aatatgcatt ttaaattgaa ggtaaattta attaaattat 28920 

ttcaaaggaa ataaggtaaa tgtattttat tgaagcatag ttatgtaata aaaatagaaa 28980 

agcatgcata ggaatgctat ttagccaata caggatgtgg taatctctat aaagggaggg 2904 0 

agggaaatgg aggggggagg ccagaggagg ggcctcatct ctgtagttta ttttttaaaa 29100 

ttataaagca aatattacca gagttaagat tttacaaaat tcattggtaa gcacctataa 29160 

ttttctgagt gctttcagta tttcataatg aaaagtatgt attttaaagg tacgttatct 29220 

atttattttt atttattttt tttgagacag tgtctcattc catcgcccag gctggagtgc 29280 

agtggtgtga tcttggctca ctgcaacctc cttaaagcta cattatttaa aagtcacata 2934 0 

caaagcaagt tgcagaagcc tgtatgtagt ggattctatt ttttttaaat agtatttaat 29400 

tgtatgttct tctacacttt ttcctatgtc catcttacca tagctgtgcc ttttttggtg 29460 

gaagtgagga cagattgctt tccacatctc catttttgtg tctgaattaa aagatggaca 2952 0 

agtatcatgt attatcttag tagtcatcaa acaaggaaaa aggtttcttt gtttgcttgt 2958 0 

tttttttaga tgaagtctcc gcccaggctg gagcgcagtg gcacggtctt ggctcactgc 2 964 0 

aacctctgcc tcctgtgttc aagcagttct ctgcctcagc ctcctgagta gctgggatta 2 9700 

taggcgcctg ccatcacgcc ggctaatttt tgtattttga gtagagacaa ggttttgcca 29760 

agttggccag gctggtcttg aactcctgac ctcaggtgat ccacctgcct tggcctccca 29820 

aattgttggg attacaggcg tgagccactg tgcccggcct ggaaaaagtt tttaatggta 29880 

aagatgtcat ggaatgaata ggattggctg gcattatttc ttgctgttaa taagcagtga 29940 

gaaatgtttc cattatatgt ttctttgaag ccagctttct ggttgctccc ttattctttc 30000 

tttctcaggc atgtggtatc tagaaagggt caggagtacc ttgataaaaa ttattgtaca 30060 

agttgagcaa aactcagtag tatcatgcct agagatctga taaagaggca cttttaaaat 3012 0 

agagccttga agctaacaaa cttttttttt ttaacccctt tactgaaacc taaaaagagg 30180 

tctgcagttt ttccctcctg tcaccagatg aaagggctgt agtagtgttt gcttattcct 3 024 0 

caggcagtga ggaatcaaag gactgaaggg gtgtttgttt tatcatacta cttgtaggga 30300 

ccccttacct cctataccca tgaaaaagga atatttctta tatcccataa tattctttta 3 0360 

gtatcacact taggttttaa tgtccttact gttagagtaa aataatttgg gcaagaccaa 3 042 0 

ttttttaaat ggcaaatata gtcacatcac ttgattcaga atctcactcc ctagcatctc 30480 

gcgtaatgac tcataagaaa gaaaaagcta aatgcatgaa gaggttcact ataccattac 30540 

tataaaaaag taaaaatttg tttttgttat tttttttttg ttttttgaga tgtagtctct 30600 

ctctgttgcc caggctggag tgcagtggcg caatctcggc tcactgaaac ctccgccacc 30660 

tgggttcaag caattctcat gcctcagcct ccggagtagc tgggattaca ggcatgcacc 30720 

accatgcctg gctaattttt gtattattag tagagacggg gtttcaccat gttggccagg 30780 

ctggtctcga actcctgacc tcaggtgatc cacccacctc agcctcccaa agtgctggga 3084 0 

ttacaggcat gagacactgc acctgggcaa aaaaagtaaa aatttgaagt caacttaaat 3 0900 

gcccccaaat acaggaatgg ttaaatgctg atgacctact ttatgtacta tgatgcaaat 3 0960 

gcaaatgata atggttactg tgcagcaaca tggaagaact tggtgaaagc aggatgtacc 3102 0 

acatgactgt aactccaaaa tgtgcttgca tgtgaggaga gatcaaggga atatagaaaa 31080 

atgaagacat agttgtgtta gggtggtaga attttgagtg gatttctccc cccactattg 3114 0 

gtaaaaattt ttgtacttaa ttcgttgtgg gcagccagat cttttaaagg taaatttgaa 31200 

tttctcttta agaaaatggc agacagaagg atgggggata ctagaaaact aaaagtagtg 31260 

ccccctttga agataaaact aaaacatttt aagcctggaa ttgctttagc agtacatgta 31320 

ttgattattt aattttgtcc tttagaagaa agttggccca acacaattac atggaagttg 31380 

ggttattgaa gaggattgat aaaagaagtg ggaaggtcag gccaggtgtg gtgggtcatg 31440 

cctgtaatcc cagcgctttg ggaggccgag gtgggtggat aacgaggtca ggagatcgag 31500 

accatcctgg ctaacacgtt gaaaccccgt ctctactaga aatacaaaaa aaattagccc 31560 

agcatggtgt tgggtgcctg tagtcccagc tacttgggag gctgaggcag gagaatggcg 3162 0 

tgaacccggg aggcggaggt tgcagtgagc ccagatcaag ccactgcact ccatcctggg 31680 

cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaaaa aaaaaaagtg ggagggtcaa 3174 0 

agccaatgtg cacgtttatt cttttgtacc accaaataaa ccgaattttt ggaccaactt 3180 0 

tactttagtc aatatttggt tatttgcttt gaagctattt gtttgcaata atgcaattaa 31860 

catcctcaac agagccaaag tctgaccctg aaatggcagc atcttaacta attttatcat 3192 0 

ttgaatagaa attgaaccct ttgagggcat gaagtactat ttcttttctc tatacttctt 31980 
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gtttttggtt tttgcttcac caccgattta tgactaccaa aaccccacag atcctgtaat 32040 

taactcacag ctgccatctt gccaccgtag atagaattca ggagctggtg atgaatgggg 32100 

agagaataga aaaactaata gactgtaaga attttagacc tctgtggcct tgctgaacaa 32160 

ttaatccagc cccttcactt tacaggtaag taaacaagtc ccagggaccc agcagtgaac 32220 

ctgtgtctta ggacttcaaa tctagtgcac tttctgttat acttcaagga gagaactgga 32280 

gggagagcga agccaaaata ctgacatttt agaggctgct ttttaagaaa ggataggaca 32340 

ttggacttgg tgtcattatc atgttaagtt atagaatttt taaaaacact accactccaa 32400 

aacaaaaata ggtaaaagat acgaacaggc agttcaccaa agaagaaata cagtaggcat 32460 

gaaagaatgc acaacataat ttaagaaatg taaataacaa atatctattg ctatctcaag 32520 

agaaaacatt tactatgagc tttattattc aaaccattgg cctcataggg cttcttgttg 32580 

acaatctacc ctaattttag gaaatgtaat tataacataa aaaaatgaat gcaaatgata 32640 

aaagacgaag tataacccgc tcaataaatt ttctgaaagt tatagtagtt ttaaacctct 32700 

ttttattcat tccctccagt tctgtctagc acctgtaaag atgaattatt ggctgggtgt 32760 

ggtggctcac gcctgtaatc ccagaacttt gggaggccga ggcgggcgga tcacgaggtg 32820 

aagagatcaa gaccatcctg gccaacatgg tgaaaccctg tctctactaa aaaatccaaa 32 880 

aattagctgg gcgtgctggt gcatgcctgt agtcccagct acttgggagg ctgaggcagg 32940 

agaattcctt gaaactggga ggtggaggtt gcagtgagcc aagatcgcac cactgcaccc 33000 

cagcctggca acagagtgag cctccgtctc aaaaaaaaaa aaaaaaaaaa aggtgaatta 33060 

tctcaggttt tatatggcat tgatggacca ctattgaagg aagctgtgaa ggactcaaag 33120 

cagttgagaa taggtatcgc cacctaattt tacttgcctt tcctgccaca cacaaccaga 33180 

gaagctggtt catctcttac tgaggaaata ttttcatgct tattcagaag attttgcagg 33240 

atcctcagga agacaaagcc aaagaacaag aacagtcaaa gtaggaacag gaccaaaagt 33300 

ctgcacagtc actggaaggt gtcatgctga agaaggcagg gatgggactt tgaaatgagg 33360 

ccaagtgcat ttcagtaact gagtgggtta tctgttgttg gcaagcaatg tgccatacct 33420 

ttgaagggct aagctagccc aggagttctg ataagagctt tgaacttaat atcgctgaca 33480 

caaaaatagg tctcctcaga tcctatttgg aggtaaaccg ggtaaaagtg ataaatagta 33540 

ttgcttttta aagttcatta catattttag aggtgagtca gtcagaccat atagagatgc 33600 

ttcattttaa tcctctgtag caagaattgg caaaaatatt ctctgtaaag agccatttag 33660 

taaatatttt agagtaggct gcatagtctc ttttgcagct actcagcttc atacaacaag 33720 

taaacaaatg agtgtgactg tattttagta aaactttgtt agacgctaac attggaatct 33780 

tacacagttt ttatatgttt tgacatattc ttttgatttt ttccaaccat ttaaaaatgt 33840 

aaaaagcatt tgtagcttgt gaggtgtaca aaaatgagca gtggaccatg ttttgcctgc 33900 

agggcacatt gtgctgatct ttactctata gcatcactgc cagtagaaat ataatgctag 33960 

ccatatgtta tggactgaat gtttgtaacc cccaaaatcc atattttgaa gctcaaaccc 34020 

ctggtgtggc aatatttgga gatgggacct tgacagatgt acttaaggtt aaatgaggtc 34080 

acaagggtgg ggtcctggtc agataggatt agtgtcctta caagaagaaa caccagatac 34140 

actcattgag aaaatgcaga acattgtaac cttactggtt tagaaaccat aatactggca 34200 

ggcttgctgg tcttcatttt tattattttg aatgtcttct ttcaagtatt ttatgtatcc 34260 

agacccaatt tgtatgataa gggaatttct ttcaaactat taggtgaagt ctttaattat 34320 

gaagttggct taagccatta agtagtatta atggggacat ccatcttaga aattaactgg 34380 

aaatgtagac tagaaatata aaaaagtgtg gtcatgacta atgtctgtat ccatttccca 34440 

aaaaagagat ttgcgtatgt cctcacattg cagaaccaca ctgacactca gggaaacatg 34500 

ctgacgtcat tcttccaaca tccttagttt ggatttcatg aaacattttt ttccatttct 34560 

cctttttctg tgtttagtaa tcctttcttg gattcttgaa atcagaggca cttattggag 34620 

ttgataaatg gcagttcttc atgtcactgt ttagaaattt aacttagctt atggactagt 34680 

tcaggcttag ataactctgg aagtctttgc actttcaaat atactttttt atagttgagt 34740 

ttctacacaa tataacattt tataacttaa atggaagtaa agtgaaatgc caaaatgcca 34800 

gttgattctg ttaatgtgtg catccctcca ttccagaatt ccaaaaatat cagacataat 34860 

gacttttatt tgtattttat attattagat ttatcagata taaaattgtt cttttaaact 34 920 

gctatttcta aacactcttt gtttctttag ttatcttgtg aactggtaat agttttcaga 34 980 

ctggtgcctg tccatgaacc actttcagca ttggtccaga tgactttcac ttataccatg 35040 

agccagcaga tactgtagat tatttggctg tttttaaaga gaacatcata tcatagtgtt 35100 

ggttattaat atttgagcaa gatattaggg ttgtctcgat gtcccaatca ttgtggagtg 35160 

cttagtgtgt gtttgttttt taagaattga cttggaaaca tttatcttaa ggtgttaggg 3522 0 

ttatttttat tctttagtac taacggtata aactgaattc cactgaaata gagcttattt 35280 

tgatgtataa tttttgaagt tattttttat aagtctgtgg gtagatagca gccaagtagg 35340 

cactttattt ccctaataga aaaaaatata tatttttgtc aaatatttct gttttattca 35400 

ttcattcaaa gtatatttgg aatgtttatt tccaggaaat tttggaatat acaatacaac 35460 

cagcttctta taactccact ttaagtgagc cataggtcaa ataatgacca gcaaaatgta 35520 

atgacacgtg tgcctcttac tccctgttgg aggaattgag gcactctggt aaccctgtag 35580 
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gcctggatta gtccagttca ttggcagcag cattatccag attttattgt ggccggcaac 35640 

ggtggctcac accggtaatc ccggcacttt ggggggctga gttgggcctg ttgcttgagc 35700 

ccaggagttc aagaccagcc tgggcaacat aggaaaccct gtctctacaa aatatataaa 35760 

aattagctgg gcgtggtggc gtgtgcctgt agtaccagct acttcggagg ctgaggcagg 35820 

aggatcacct gagcccagaa agttgaggct gtggtcagct ctgattatgc cactgcaccc 35880 

cagcttgggt gatacagtga gaccctgtct taaacaaaca aaagagattg tattgtgttt 35940 

tgaaaaacat agtttttaga gaaattctgt gttattcttc tatttccatt acagcttggt 36000 

ttaaagaaac tatggaaagc tacagactaa cctccttttc ctttggtttt ttttccatgt 36060 

tatagcagag aaccagccag ggccagattc ctgggcttgt gtcctctgca gttgcacagg 36120 

gctctgctca gaagggtccc atacttggct taatgctctg ctgtcagcat cttgaaaatt 36180 

tttaataatt ttatatttca tcttgttttg taagttataa gcccagtggg ccagtggagg 36240 

ctgttctagg ggtctgggtt tggctcatga gaggtacatc cttcccacct ccccaggatg 36300 

ggctctcagc agctggcttt ctgtaccctg gcaccccagg cccttcttgg cccctacccc 36360 

gtgttcactg ttgccctcac cttgggtgat gactgggtca catgggagag gaggaaaccc 36420 

atgttctgct gtcaccctcc acctcttgct gggtcctggg tacaagctca gggagggttg 36480 

ggatcaggcg tctgtgactc acagtgttgt ggggagtgat ggtggtcatt ttatcccagg 36540 

ctggcagcgt gctggcatat tgatgagcag ctgctggcat gatgatttgt cctgaccctg 36600 

gaccggactc tgtatcttgg ctttagtcta gtccacttct cagcattcct tggaagccca 36660 

cctaaactgc agagacaaag aagaatatgc tttgggattt taaagatgga accccatatg 36720 

ctatcaattg gaaaggagtt cagattctgg ctgggctgaa tcactgattt tccaggcaag 36780 

tggaacttta atgacccact agtgcccttt gttcactgtg ggctggctgg ggaggaaccc 36840 

ttgcctttaa agaaccctgt gctctttaaa gtttccaggg cacaattgag atccttgagg 36900 

tacagttcat ccatgcagac ctagcacctg ctttaggtaa gggctcttgt tacttcttgt 36960 

tcttatcggg ttcttatgtt cagctcagct tcctgcttcc tcagagagaa gtggtaggtt 37020 

ctgttttagg cttttctttt ttggtggtgg tgggaggtgt caaaccgcat tattgagggg 37080 

agtgggggtg gtaaagcagt ggtgaggacg atgagaagcg acctccttag ccaggccagg 37140 

agctcacctc ccagaccctg cccttcaggc actgaaaact gttttaggct tttctgcatg 37200 

gttggagttt actcttagtt gctgacagtc actcatcccc tggagagggg tactgagcat 37260 

aatcactgtt ggtattttat ttactaccag ttcttcatca ttaccgagac tcattccatt 37320 

tggaaaataa acattgcctc atctgttcat ggtatatttt tacacttgcc tttcactcac 37380 

atgttgcata ttaactgaaa agatcagcat gatcttggag cctaatgtga tctggatctt 37440 

attaccatgt atgtttttat cgtcaggtgc tgctttggac tctggcccat tcctcccacc 37500 

tgtcttcatc ctgtcggggt tgatctctgc ctggttccat agtcacttgg cttttggagt 37560 

tttatttctt ctctaaccca ctcttccctt ccctgtcaaa tataattctc tcatgagcaa 37620 

ctgtgcctga taatcccagt gtataggcca tagctcaggt tgttttaatg cttgcaagga 37680 

agtctcttca gattactaca gggtaaaaga atccattccc tttcaacctc tcattactta 37740 

ctgaatcaat tatttattta cttgtgtaat tcacatccat gctcaggtat gtgtttagtt 37800 

tacctctgag actgtggctc cctctagaca agggtctttg tctggtatta gtccacttcc 37860 

tcagtgtcta atttcatgag gacagaatca gggcaagacc tactggctaa gacttgctgc 37920 

tgcctcagtt atttggttta ctgtttaagc ccttgagcca ggaggtgaca gtatcttttg 37980 

tgttccaagt cagataacca gtaacagtaa tattctgatg taggtctaag gggcaatagg 38040 

agcctgaatc tgagcccctt ggcagggatg gtttccctgg gttactggat ttgagtcttt 38100 

gtcttccgta atagtaactt ctgtgaccct tggctcaagg agtcctttct aacctaaatg 38160 

ccctcctgaa gaaagtgctg atattcatca gaaaaaaaag tagggtttgt gttatgttac 38220 

catctgggag acattggctt taatcctctc ccctcttctt ttccttttaa aattttcaag 38280 

aatccccaga attgagggtc aatcctggat cagaaggggc agaaaagtca gccagtcaga 3 8340 

ggtgagaaag gggtcttgtg ttccagttgc tctactttag agggcttgag ggccgcatta 3 84 00 

ggagcaaacc aaatactctg aggtcctagg agaagattgc tctaaatcca ggatgatttc 38460 

atcatatgac ttgagatgga ggggttcttg attagtaatg ggctttttga ttggcaggaa 38520 

gtaggtaaac tgcaggcaat aggattggcc atagaaatat agggaactct atgctttcaa 38580 

ctcttgtgaa gttaggacca aaccaaatta tttggggttc attggtggct aagaaattaa 38640 

cttttatact tatagttaaa aaaaaaaaac ttaaagaact tttattaatt cttccccaat 38700 

agagtgattt tttttttttt ttaagagacg gagtctggct ctgttgctca ggctggagtg 3 8760 

cagtggtgca atcatggcca cggctcactg cagccttgac ctcccaggct caagtgaccc 3 8820 

tcccacctca gcctcctgaa gtagctagga ctacaggcgt gcaccaccac acccagctaa 38880 

tttttgtatt ttttgcagag atgagatttt gtcatgttgc ccaggatagt ctccaactcc 38940 

tgggctcggg ccatctgccc gccttggcct tccaaagtgc tgggattaca ggcatgagac 3 9000 

accatgccca gccagaatga ttttcctttg ttgaattctc aaaattttat tctggcttag 39060 

gtaattgagt aggaatggca gagagtaata attttcagtg taatctttag cattacataa 39120 

acctctacaa acatgaagta aaaagttctt tacagacttt ctataggaaa aataaatgtt 39180 
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tatagatctg cttatcaact gtttgttctt cctacctctg cattttcatc tattgataac 3924 0 
tgtcagagtt caaggtcagt taattgtaca ttttctggga gtttttctca ctgttaatta . 3 9300 

gtaagacttt gaatcattaa gtgctgatga ctacatggca caattcatta aagacagtca 39360 

tactttaacc ttagtacagc tataatggga cggtgaaata caactaggaa agtatcttat 3 942 0 

taaagtattt tacattttac taataaaatt tatttaaacc ctgaattagt gtttattttt 39480 

gtttccgaat tctcagtaac atctcagtag atccgtgaat cctgccaaaa gacgtatttt 39540 

aatccaaaga tctttctgca tctataattt accacactaa agctcacatt atcattaaaa 3 9600 

gcagcattat cattaaattg tagacattta ttagttcatc tccaggccta atgggagtgg 39660 

tttggagtaa acctttgtag aaacaataat atgttttgat aaatgcatat gggcgtggag 39720 

tggttatcac ttacctcctc atgaaatttc tggaaaggtg atccggaaaa ccaggacaca 39780 

tttatggtaa gatataacac cttaccataa cgattaaggc tcattgaaaa actctgactt 39840 

aatgagttta ccaaaaaact gttctatcca catctcatca aaccgcccct gaaattccct 39900 

tgcctctgtt aaatttttct agacagtctg aatagaggca tgtaattttt ttggattttt 39960 

ctgtggttaa ataaatatcc tttacaactc tctttattct tgaatatcca taagagttta 4 0020 

tatttatact gtatgtttgt tattaggatt cctttcaatt gctatataaa aaatgtaaag 4 0080 

tctgtttact gccttaaacc ttctggtgta tttttatata aagtaacacc cttaattcta 40140 

acttggccaa caggtaggat ggtattatta ttatcttcat tgtacagata aggaaactga 40200 

ggctcagatt gactagatca aacaggagtt ttctggaaaa cctaggacac aagcctaaat 40260 

ctttgaactc aaatactgct ctacactgaa ttacagttat atactgattt ctgttgtaaa 40320 

ttcttagaga agacagacat agaaattagt aacttgagtc agtagcggct ttgttcaaac 40380 

acaggcacat gcatatttta tggtatatgt ttatatctgt gtaatactca tcataaatgt 40440 

cagatttata atcgatagtg cccatttcta aatttatagt tgaaacctct gataggaatt 40500 

aggaattatc ttaaagtcct agaataataa atattaagat tttgaagact gcttaaaaca 4 0560 

gtgttccagg ctggattttt tccttagttt ttttttcctt agttttttac cttagttttt 40620 

tccatgtaat gaatgaggag ccattggggt tggtgtttgg ttgtaaaggg caggttgact 40680 

tcaccaggag gtgtttcttg gtatttatgg atcctctttt tcacactaat cctttgatta 40740 

gcttcttctt atagttcatg cttgtacagt tgcagctgaa tggtaagtag gtagaaatat 4 0800 

gcattgacta atgttgaact atatctagga gcgtcatatt catgctacta ctgagcactg 4 0860 

tgacctaggt gaagtgtgag attaaagagc tttattgctt accacgtttc tttttgatct 4 092 0 

agttagagtc atgtgaggca gtccatgact tttaggatgt tataatataa tccacatagt 4 0980 

actttatact tttcaatttg tttttatata tttttcctta ctataagcct gtcaagtcta 41040 

gtgagatgac atggttatta atatattaca aatgaagaac tctgacttgg aggtagtata 41100 

gcactgtggt taataataga agttcttata tttttatatg aattatatat tatgaaccag 41160 

accagaatac tagctctacc aattacctag ctgagtgatg gtgcagcaac ttaatctctc 41220 

aaagccctgg ttttctcatt gtaaaatgga ggaccttata ggattgttgg gtagcttaaa 412 80 

tgagataata ggtagaatta aaagtaacgg caaaaaccgc aattactttt gcaccaacct 4134 0 

gatatttaga acagtggagg catatagtag gcattcaata aatattggta gtattatttt 41400 

cagaggttat tactcattta actcaacaaa aattgagagc ccctttctat acgctagtca 41460 

ttgtgctagg catcaggagc acagggcaaa cctggtaggg tgcttactct gatggaattt 4152 0 

acattttagt ggtagaaatg gtaaatgagt caacaagaat tttatggagt gacaaggact 41580 

ttgaggaaag taaaacggga caatgcttga gaagggtgct agttgggaag tcctatcaca 41640 

cgaagtggta tttgatctga gacctgagtg accatagcca catgaatacc tgggggaatc 41700 

acttcccagg aggaaacaga agcacagaag ccgaaaggca ggatcacagg cggcacattt 41760 

tattggtgca gtgtggaggt agagggactg gattgtgtag ggtttgtagg ccagggtaaa 41820 

gagcttggat ttcttctgag tgtaggattt tgagcagggg agtgatgtgt cttattttgg 41880 

gcgggcgcta ggccaggtac tgtgaaggaa ataatggtat agaagaaata tgtctttctt 4194 0 

gctagaatct tatgtgacat acaactagtg gtggctgatc agcaattaga actgcagtgt 42000 

tcttgctggt gtactcctaa tctgatcatt ggatgatatt tatcacgtat ttttgcagca 42060 

gtcatctcta aagtggcatt atggttctag ttatctatca ttggatctgt ttgctctaac 42120 

agtccgaagg ggcatgaagg gtggaaacag agtagaggga aatgggtgaa aaggctattt 42180 

ctaaacacta gacacaaaat agggattagg aagtagtggg ggttaggaga aaaagcaaaa 42240 

tcctgttgtg catggagagg gacattgttg aaactgtggc ttcttgagat tacatgagtg 423 00 

aacctgcgtg gactgggtcg tcttccctat tcaccaagct gaatccaggc ccaatccaag 42360 

attactgctg cctttgctct gaggctttag actgttagta gcttttctcg tctctactga 42420 

ggccaaaagg ggcagtgata cccttgaatt ttcttcttaa aacagggttt catttccttg 42480 

gaagtttgtt tctttgaatc tttctgtcag tttaactgtt atcatcaatt ggttagcatt 42540 

ctaataataa ttataattat agtaaacatt tattgagtgc ttacgaagag ccagttccaa 42600 

gcttttttat ctccattatt ctgctacttt ccttctcatt ttacagatga ggaaaatgag 42660 

gcacagagtg gttaattaat ctgtttgagg tcccgtagca ggtcagtgat gccagggttc 42720 

aaacctacac ttaactctac actagagact gttttcttaa ttatttcttc acaatcatat 42780 
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gtttaatgat tacttattga ttatttagtg gtctgataag aagagggagc ggtgctcttc 42840 

tgttggagaa gaaaggctgg ctgatcaaga cacactggtt ggtttgaaga aaaaatatag 42900 

atgttaattc cataacacca cactctaaac atttctactg gacgagttcc acctgtgtgc 42960 

cactcgaagt cggatgcagt aaggaaggct ttttattgag gagagaacga atacctctgt 43020 

attcaaaaga gagtgtgttg ttccttatag aagatggaag ggggcttgcc agtgacagat 43080 

tatgatgatt acctccttag tggttttttt ttattgcaca gactataata ataattataa 43140 

aaatttgtaa atcaagaaaa atctttaccc caacaaattg tttttatatt tttcatattc 43200 

atattttttc atattcattt tattttttca tattcaaaaa taaaacaatt ttcatattca 43260 

tttcaattgt ttattttttc atattcaaaa ataaaacaat tttcatattc atttcaattg 43320 

ttttattttt tcatattttt catatttcat attttttcat ttttatacac agttatagag 433 80 

tatctactta tttttcattt aacattatgc tataacaggt tttaaagtta ggcagtaagt 43440 

ctttccagtg aaaaggaaaa gactgaaaca tgaaaaggtt acgaggcatt actaattgat 43500 

tggaatgatg cctaccaggc agcccatgtc agttttggag ctggccttgg aaggaacagc 43560 

tgtgtatttg gaccttggag cagtctgtca ggcttccaag aaagtgggca aagaaggaag 43620 

99999ttggg ggcacaagtg gaaaagtaca gtggtttgga actgatgtga aactatcaag 43680 

ctggttgagt tcagtgttag aaaataggaa aaacaaaaca atttctacct gatatggtac 43740 

tttaattgag gttagcactt cagtgaatag gcaatgcatt ataatttgtc agaattatca 43800 

atattttgtt cttgtcagtg gctttatatt taatagttac atgttataaa gccagtttta 43860 

tcagtgaaag aacaaagttc ttgacaaatg ctattttagt gataaagctg ttattctgta 4392 0 

tttaaattca gtttaactca aagtggtttt taagttttac atttgtataa ctgtagacta 43980 

gccatatggc attcaaaggc ctccaagata taacttgaga cactcttagg atggaatccc 4404 0 

atattcagaa tctttactga cagctcccca tatccaaatt gccttttctc tcttgggcct 44100 

ctgggtgtga caagggacaa cttgagtgaa aggccacagg acaaagtaga gatgctgtta 44160 

aatctcaatg cagaaggcct gagacaactg atagagacaa ttgtttctct tttctccctc 44220 

ataaaataag taaaattcaa agtgattttt ttttaatttt ttgctttgac acattgtgtt 44280 

tagtctgatt gaggtcatct tacacaacag aatccaaaat cctgagaaaa taatttatac 44340 

aactagaatt ccaaaattag gttttgcatt aagttcaagc tatcatttct ttaagaaaac 44400 

ctacactgga gttaccaact ggaatatttg gttatgagac acagtcatca tgtcagagta 44460 

tgtagtttgc ttctaacact tatttgtgtg cttttaagac ctgctccctg ccacacttta 44520 

tcataacaaa aaacaagaga cacatgttag tactgtagca tgtatagtca tggggaatgt 44580 

gttctaaacg aaatcccatc acagtggagt caacacaata tttctagggg agaaggtctt 44640 

cctgttaaac tcatgtgaag gcattcttct ctactatggg aacttgttta tgtgccctct 44700 

agaagacagc tgagatggtc ttcagtgaat ctgttcactg acatgttgct agtttcttta 44760 

tgtttacatg taaacagtgc ttagaagcat ctttccccaa gctgttcttt ttttgatgga 44820 

ctcccccttt ttgggggagt atcagttgaa tgagcactgt tgtactctca gcactagaca 44880 

ttctgttatg tgtgtgtgga gcttccttgt ggcttggtag cattacctct gcattacatg 44940 

ctgcagcctc accaggcaga agggccattc acttctgctg ctacttacca gcacctctgt 45000 

ttcctcgagt gtgttaccct cccaccacct gggccttgga cctgggtccg ttgggctatt 45060 

atccatgctc ccacctgccc atctctgctc tgatataatc tactaggata ataaagtagc 45120 

ttttccctac cataatgttt tttaaataga catacagcaa tgtataatgg ttaagagcat 45180 

gatccaaact gtgtctgatt ttgaatcctg gctgagttac tagctatgta atcttgagaa 4524 0 

aactactcaa tctctctgtg ccttaatttt cttaggtata aagcagatac taattatgcc 45300 

atcatagggt aggtatgagg attaaatgag tgagtatttg taaaacactt aaaacagtga 45360 

ctgacttggg ctaccccttt tgggtcccct ccctttgtat gggagctctg ttttcactct 4 5420 

attaaatctt gcaacttcac actcttccag tctgtgtttg ttatggctca agctgagctt 45480 

tcgctcgctg tccaccactg ctgtttgctg ccatcgcaga cccgccgctg acttccaccc 45540 

ctctggatcc ggcagggtgt ccactgcacc tctggtccag cgaggtggtg cccattgccg 45600 

ctcccaatcg ggctaggggc ttgccattgt tcctgcacgg gctaagtgcc ctgggttcat 45660 

gctaattgag ctgaatagag ctgtaacact cactgtatgg cccaaggttc cattccttgg 45720 

aatctgtgag gccaagaacc ccaggtcaga gaagaagagg cttgccgcca tcttggaagc 45780 

agcccgccac catcttggga gctctaagaa caaggacccc ccggtaacat tttggccacc 45840 

atgaagggac ttccaaagcg gtgagtaata tgggaccact tttgcttgct gttctgccct 45900 

attcttcatt ggaattggag gaaaataccg ggcacctgtc agctggttaa aaacaattag 45960 

catggccacc agacttaaga ctcaggtgtg aggctgtctg gggaagggct gtctaacagc 46020 

ccccaaccct tctgggttgg gagcgttggg ctgtctggaa ccagcttcca ctttcagttt 46080 

tcctggggaa gctgagggcc gactagaggc agaaagctgt tgtcctgaac tcctggtgtt 4614 0 

agctggttga gatcatggcg cagccagaag tctctactca acagtcagcc atgcgtgcac 46200 

ccctaccttt ccttctgacc tatacctctt ggtcctgacc acaacttgct tgaaagtgta 46260 

gccccaaaat tctccttacc tctgaatcta cttcctccaa tccctgcctc ctaggtacta 4632 0 

atggttcaga ccttcatttc ctctagcaag ctgtatctcc aaagggatct aaggaagctc 46380 
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tatgctgtgt ccttaagccc ctaggctctg aacccagata gtcttgtccc tggtgtccct 46440 

cccaatttag gcatacagct ctcaacatgg gcagttatgt aggacctgtt ccccaccatc 46500 

cttgccaggg tcccaggttt gtaaagggct aggagaagag agagagagag agagacagag 46560 

acagagggga gagaaagaga gagagacgaa gagggagtca aagagaaaaa gaaagagaaa 46620 

gatagaagta gtaaagaaaa aatagtgtgc cctattcctt taaaagccag agtaaattta 46680 

aaacctataa ttgataattg aaggtcttct cctgaccctg taacactcca ataccacttt 46740 

attgtcagtg taaacaaggg ggtagcccaa aaacactgag accactgaca acctatcaaa 46800 

atcctatcaa aaatccgtaa cccagtaaca cgtggatggg ccaaaggcat tcagtcggta 46860 

gcggcaactg ctttgctaaa agtagaaaaa taactttaga ggaaacctca ttgtgagtac 46920 

acctcaccag ttcaaaacta tcctaagtca aaaaagcaaa aaggtaactt actaactcga 46980 

aaatcttaaa atatggggct cttctgttag aataaaggta acttattaac cactgaaaat 4 7040 

tcccttaacc cagcagattt cctaacaggg gatttaaatc ttaattacca tacaaaggtc 47100 

cgacagatct aggaggaact cccttcagga caggatgata gatggttcct cccaggtaat 47160 

tgagggaaaa aaccacaatg ggtatttagt aattgatagg gaaactcttg tagaagcaga 47220 

gttaggaaaa ttgcctaata attggtctgc tcaaacatgc gagctatttg cactcagcca 47280 

agccttaaag tgtttacaga atcaaaaaaa ctcaatctca atcctgactc aaaaggttac 47340 

ctacaccctc tctgaaatga atttgcataa gaactgttgt ttatgggaat gcatcttgat 47400 

ggggcaactg ggttgttatg aaatactcag gaacccagcc cagctccagg actcacccct 47460 

gagcaaaaaa gcaatgttgg gcactctggt aaaggaccac tagaatccag cagcctggac 47520 

ccctttcttt gtggtcaaga aaggcaggaa aaggggtgca ggactgctac atcagtgagt 47580 

gcaactaatc tgataagcag aggtccatgg gtggttacac accctggaaa ggaataaaca 47640 

ttaggaccat agagaacgct ctaggactaa tgctcattgg aaaatgacta ggggtgctgg 47700 

catccctatg tccttttttc agataggaaa tgttcccccc aaggcaaaaa tgcccgtaag 47760 

atatattctg gagaattggg accaatttga ctctcagatg ctaagaaaga aatgacttac 47820. 

attcctctgc agtaccacct ggccatgatg tcctcttcaa gggggagaaa cctggcctcc 47880 

tgagggaagt ataaattata acaccatctt acaactagac ctcttttgta gaaaagaagg 4 7940 

caaatggagt gaagtgccat atgtacaaac tttcttttca ttaagagaca acttgcaatt 48000 

atgtaaaaag tatgatttat gccctacagg aagccctcag agtctacctc cctaacctga 4 8060 

tgtccccctg actccttccc caactaataa ggacccccct ttcaacccaa acagtccaaa 4 8120 

aggacataga caaaggagta aacaatgaac caaagagtgc caatattccc tggttatgca 4 8180 

ccctccaagc ggtgggagaa gaattcggcc cagccagagt gcatgtacct ttttctctct 4 824 0 

cacacttgaa gcaaattaaa atagacctag gtaaattctc agataaccct gatggctata 4 8300 

ttgatgtttt acaaggatta ggacaatcct ttgatctgac atggagagat ataatattac 4 8360 

tgctaaatca gacgctaacc tcaaatgaga gaagtgctgc cataactgga gcccgagagt 48420 

ttggcaatct ctggtatctc agtcaggtca atgataggat gacaacggag gaaagagaac 4 8480 

gattccccac agggcagcag gcagttccca gtgtagctcc tcattgggac acagaatcag 4854 0 

aagatggaga ttggtgccgc agacatttgc taacttgcgt gctagaagga ctaaggaaaa 48600 

ctaggaagaa gcctatgaat tattcaatga tgtccactat aacacaggga aagaaagaaa 48660 

atcctaccac ctttctggag agactaaggg aggcattgac aaagcatatc tctctgtcac 48720 

ctgactctat tgaaggccaa ctaatcttaa aggaaaagtt tatcactcag tcagctgcag 48780 

atattagaaa aaaacttcca aagtccgcct taggcccgga gcaaaagtta gaaaccctac 48840 

tgaacttggc aacctcggtt ttttataata gagatcagga ggagcaggca gaatgggaca 48900 

aatgggataa aaaaaaaaag gccactgctt tagtcatggc cctcaggcaa gcggactttg 48960 

gaggctctgg aaaagggaaa agctgggcaa atagaatgcc taatagggct tgcttccagt 49020 

gcggtctcaa ggacacttta aaaaagatta tccaaataga aataagccac tcccttgtcc 49080 

atgcccctta tatcaaggga atcactgtaa ggcccactgc cccaggggac gtaggtcctc 49140 

tgagtcagaa gccactaacc agatgatcca gcagcaggac tgagggtgcc tggggcaagc 49200 

accagcccat gccatcaccc tcacagagcc ctgggtatgc ttgaccattg agggccagga 49260 

ggctaactgt ctcctggaca ctggtgtggc cttctcagtc ttactctcct gtcccggaca 49320 

actggcctcc atatctgtca ctatcccagg acagccagtc actagatact tctcccagcc 49380 

actaagttgt gactggggaa ctttactgtt ttcacatgct tttctaattg tacctgaaag 4 9440 

ccccactccc ttgttaggga gagacattct agcaaaagca ggggccatta tacacctgaa 49500 

cataggagaa ggaacaccca tttgttgtcc cctgctggag gaaggaatta atcctgaagt 49560 

ctgggcaaca gaaggacaat acggatgagc aaagaatgcc catcttgttc aagttaaact 4 9620 

aaaggattct gcctcctttc cctaccaaag gcagtacccc cttagaccgg aggcccacca 4 9680 

aggactccaa aagattgtta aggacctaaa agcccaaggc ctagtaaaag catgcagtag 49740 

cccctgcagt actccaactt tacgagtaca gaaacccaac agacagtgga ggttagtgca 49800 

agatctcagg attatcaatg aggccattgt ccctctatac ccagctgtac ctaatcctta 49860 

tattccgctt tcccaaatac tagaggaagc aaagtggttt acagtcctgg accttaagga 49920 

tgcctttttc tgcactccta tacatgctga ctctcaattc ttgtttgcct ttgaagatcc 49980 
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ttcgaaccca acatctcaac tcacctggac tgttttaccc caaggattca gggatagccc 5004 0 

ccatctattt ggccaggcat tagcccaaga cttgagccag ttctcatacc tggatattct 50100 

tgtcctttgg tatgcggatg atttactttt agccgcccgt tcagaaacct tgtgccatca 50160 

agccacccaa gtgctcttaa atttcctcgc cacctgtggc tacaaggttt ccaaaccaaa 50220 

ggctcagctc tgctcacagc agagggctat ttatccctaa atacttaggg ctaaaattat 50280 

ccaaaggcac cagggccctc agtgaggaat gtatccagcc tatactggct tatccttatc 50340 

ccaaaaccct aaaacaacta agaaggttcc ttggcataat aggcataaca ggcataacag 50400 

gtttctgctg aatatggatt cccaagtacg gcaaaatagc cagaccatta tatacactaa 50460 

ttaaggaaac tcagaaagcc aatacccatt tagtaagatg gacacctgaa gcagaggcag 50520 

ctttccaggc cgtaaagaac accctaaccc aagccccagt gttaagcttg ccagcggggc 50580 

aagacttttc tttctatgtc acagaaaaaa taggaatagc tctaggagtc cttacacagg 50640 

tccgagggac cagcttgcaa cccatggcat acctgagtaa ggaaattgat gtagtggcaa 50700 

agggttggcc tcattgttta cgggtagtgg cggcagtagc agtcttagta tctgaagcag 50760 

ttaaaataat acaaggaaga gatcttactg tgtgaacctc tcatgatgtg aaccgcatac 50820 

tcactgctaa agaagactgg tggctgtcag acaactgttt gcttaaatat caggctctat 50880 

tacttgaagg gccagtgctg tgactgcgca cttgtgcaac tcttaaccca gcgacatttc 50940 

ttccagacaa tgaagaaaag atagaacaga actgtcaaca agtaattgct caaacctacg 51000 

ccgcttgagg ggaccttcta gtggttccct tgactgatcc caacctcaac ttgtatactg 51060 

atgaaagttc ctttgtagaa aaaggacttc gaaaagcaga gtgtgtagtg gtcagtgata 51120 

atggaatact tgaaagtaat cctctgactc caggaactag tgctcagctg gcagaactaa 51180 

tagccctcac tcaggcacta gaattaggag aaggaaaaag ggcaaatata tatacagact 51240 

ctaagtatgc ttacctagtc ctccatgctc acgcagcaat atggagagaa agggaattcc 51300 

taacttctga gggaacccct atcaaacatc aggaagccat taagaaacta ttattggctg 51360 

cacagaaatc taaagaagtg gcagtcttac actgctgtaa gaaaggacag agaaataaaa 51420 

gggaaccgcc gagtggatat tgaagccgaa agagccacaa ggcgggaccc tccattagaa 514 80 

atgcttatag aagaaccgct agtatggggt aatcccctcc aagaaaccaa gccccagtac 5154 0 

tcagaagaag aaatagaatg ggaaacctca tgaggacgta gtttcctcct caggatggct 51600 

agccaccaaa gaaggaaaaa tacttttgcc tgcagctaac caatggaaat tacttaaaac 51660 

ccttcactta ggcattgata gcacccatca gatggccaaa tcattattta ctggaccagg 51720 

ccttttcaaa actatgaagc agatagtcag agcctgtgaa gtgtgccaaa aaataatccc 51780 

ctgcacttca ggccatgcat ttcaatccct gaatctttaa cctccttgtt aagtttgtct 51840 

cttacagaat tgaagctgta aagctacaaa tggttcttca aatggatccc cagatgcagt 51900 

ctatgactca aatctaccgc ggacccttgg accggcctgc tagtccatgc ttcgatgttg 51960 

atgatatcaa aggcacccct cccgaggaaa tctcaagtgc atgaccctta gttgcaccag 52020 

ttcagcagga agcagttaga gcggccgttg gccaacctcc ccaatagtac ttgggttttc 52080 

ctgttgagag gggttgctga gagacaggac tagctggatt tcctaggccg actaagaatc 52140 

cctaagccta gctgggaagg tgactgcatc cacctttaaa cacggggctt gcaacgtagc 522 00 

tcacacccga ccaatgaggt agtaaagaga gctcactaaa atgctaatta ggcaaaaaca 52260 

ggaagtaaag aaatagccaa tcatctatca cctgagagca cagggggagg gacaatgatc 52320 

aggatataaa cccaggcgtt ctagccggca acggctaccc tctttgggta ccctcccttt 52380 

gtatgggagc tctgttttca ctctattaaa tcttgcaact gcacaaaaac caaaccaaac 5244 0 

caaaccaaac aaacaaaaaa acagtgactg acttatggta aacattatat aagcataaag 52500 

taaaccaaat actttttttc taattataaa agtcctacac taacattgca gaaaacttga 52560 

ggaattcaga aaagttatta cttagtaaga gttggaatga ataaataagt gggtagttag 52620 

gatggcaggc atgtgtttta ggcagagaga tacaagataa agaactaaaa ctagaatctg 52680 

gtctttgaac ccctggcctg attgtcttat tcatcatgat gatttgccta tttttccaat 52740 

ttctaaatca ttcctctgct gttgacaaag caataaattg ttatatttga taagtgaatc 52800 

ttcagagaac tggccttgag ccagctctac aactaaccag ctctgtggcc ctttggagaa 52860 

tttcttaata tttgtaaacc tcagctttcc taccagtgaa atgaagttag tcctccctgt 52920 

cctgcagggt tgctgcaagg atttaacaac atgtatatgt acaaaccact tagtcctgtg 52980 

cttggcctat ttggtgcttt ttttttttct tttttttaag acagggtctt gcttgaatct 53040 

tgctgaggct ggattcaaac tccggggctc aagtgatcct cctgcctcag ccttccaagt 53100 

agctgggact acaggcctgc accactgtgc ctggtggcag tgctcgttga atgttctttt 53160 

tccttagttc cttcctagct cttctgacag ttttggggct tatgtatata agaaggactt 53220 

ggttgcctca gggagagagg atgcagtaga gttacatagc tcacctcaca tcctccaaaa 532 80 

gctgaattca taagtaaaca aagtgagcat ttcacccata ctttacacaa agtctagaat 53340 

atttatggtg tccatcaggc tcacatactg tgaccttctg agatactttt ccctctccat 534 00 

tccctttcct tctccctgct ggcttttttt tttcttcttc ttcttttttt ttttttttta 534 60 

ctgtgaaaaa caacctatat acagaatagt acaaaaacat acctgtatag tttgaagagt 53520 

aattattaac agtcttatta agaaacaatg ctccatccat gttactgcaa aagacatgac 53580 
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cttattcttt ttcttcttaa tttttttctt tttctttctt tattttggcc ctttttcaga 53640 

tctagacctg cagagatctt gttctttttt tgagacagca tctcgctctg tcaccaggct 53700 

ggagttcagt ggcgtgatct cggctcactg caccctctgc ctcctgggtt caagtgattc 53760 

ccctgcctca gcctcctgag tagctgggac tacaggcgcg tgccaccaca tccagctaat 53820 

ttttttattt ttagtagaga ctgggtttca tcatgttggc caggatggtc tcaatctctt 53880 

gacctggtga tctgcctgcc tcggccccca aaagtgctgg aattacaggc atgagccacc 5394 0 

acgcccagcc gatcttgttc tttcttatga ctgtgtagta ttccatggta tatatgtacc 54000 

acattttctt tatgcattct atcattgatg ggcatctagg ttgattccat gtctgctatt 54060 

gtgaacagtg ctgcagtgaa cattcacgtg catgtgtctt tgtggtagac tgctttatat 54120 

tcctctgggt atatgctcag taataggatt gctggattga atggtagttc ttcttttagc 54180 

tctttgagga tactgctttc cacaatggtt gaactaactt acactcatac agtatataag 54240 

cattcctttt tctctacaac ctcgccagca tctgttactt tttgacttat aggtgggagc 54300 

taaatgataa gaacttatga atgtaaagaa ggaaacagac actgggatct acttgagtgg 54360 

gaagagtggg aagagggaga ggagcaaaaa agataactct tggttactaa gcttaatact 54420 

tggacaatgt aataatatgt acaacaaacc cctgtgacac atgtttatct atgtaacaaa 544 80 

ccttcacatg tacccctaaa cctaattttt tttaaaagaa acagaatgcc agccaggcat 54540 

agaggctaat tgcctctaat cccagcactt tgggagggtg agatgggcag atcacttgag 54600 

cccaggagtt taaggccagc ccaggcaaca tagcagaacc ccatctcttc aaaaagtaca 54660 

aaaattagct gggcatggtg gtgtgcacct gtagtcccag ccccttggga gactgagctg 54720 

ggaggatggc ttgagcccag gaggtcaagg ctgcagtgag ctgtgatcat gccactgcac 54780 

tccagcctgg gcgacacagc aagaccctgt ctccaaaaaa aaaaagaaaa ggaacagaat 54 84 0 

gttaccagcc caactgcact tcttcactcc cccaccactc taatgaagtc atcactaacc 54900 

cacttctaaa gtactcacat accctatgtc tatggaggta tgtcagtgga ggctaaggta 54960 

tgccagtgga ggcttatgta ccttatgtct aaatatttaa agttattaaa ttaaaaaacc 55020 

attaaaatat gctttctacc ttgacaaacc tttataacaa aattagaaaa tgtttaatgt 55080 

tatggcatta aataattgaa agcaaaatat caaagatgat agaatttaat taattatttt 55140 

attttatttt atttgagaga gggtctttct gtgtcaccaa ggctggagtg cagtgatgca 55200 

atcatggttc actgcaacct caacttcccg ggctccagtg atcctcccgc ctcagcctcc 55260 

caagtggttg ggactacaga catgtgccac caaatccagc taatttttaa attgttttta 55320 

atagaggtaa gggtctcact atgttgccta ggccagtctc gaattccagg ggctcaaggg 553 80 

atccttttgc cttgtcctcc cagagtgctc ggatttaagt tgggagccac tatacccacc 55440 

caacataatt caattattta atatttcaca tgttttagta ttcctttgat agggatgtga 55500 

tgtttgggtg aataataaag taaatcaaag acatatattt gaaaattatg tagttattct 55560 

aaaaaattaa ttatttacct ttattttagc aaaatcagtg tgttagcata atcaagatat 55620 

tttggtattc tagtaacaag atctagtcac agtaatgatg taaagattaa aaaataaaat 55680 

ataataggaa ccagtataaa caagtggaat ttaaatttaa aatgcaatac cagctgggtg 55740 

cgatgcctca cgcctgtaat cccagcactt tgggaggcca aggcaggcgg atcacctggg 55800 

gtcaggagtt caagaccagc ctgaccaaca tggaaaaacc ccatctctcc caaaaataca 55860 

aaataagttg ggtgtggtgg tgcatgcctg taatcccagc tactcgggag gcggaggcag 55 920 

gagaatcact tgaaccaggg aggcagaggt tgtgatgagc cgagatcaca ccattgcact 55 980 

ccagtttggg caacaagggc aaaactctgt ctcaaaaaca aaaagaaaca aaaaacacag 56040 

taccatttac attagcaccc ctcaaaatga aatacttagg tataaatcca gcaaaatagg 56100 

tataagagat atataagtaa aactataaag ctctgatgaa agaaataaaa gaaccaaata 56160 

aatggacaga tattccatgt tcatggatag gaagactcag taatgtcaag atgtcagtcc 56220 

ttttcatctt tatctataga ttcagtacac ttccaatcaa aatccccgct agttattttg 56280 

tggatattga caaactgatt ctaaagttta tgtggacagg caaaagaccc aaaataggtg 56340 

acatgctatt gaagcagaat aaagttaaag tctctacttc ttcttcaata tcattttctt 56400 

caatatcaaa tcaactcatt ttcttcaata tcaagtcaac tgtagactga cagtgtccta 56460 

cttcaagact tactgtaaag ctacagtaat cagaacagta tggtattggt gaaagaatag 56520 

acaaacagat cgatggaaca gaatagagag cccagaaata gacccagaca aatacagtta 56580 

atctttgaca aaggagcaaa ggcaatagaa tggaggaaag ataccctttt caacaaatgc 5664 0 

tgttggaata actggatgtc cacatgtaaa aaaatgaatc tagacacaga ccttacaccc 56700 

ttcacaaaaa tgaacacaaa atggatcatc aacctagaca tgaaacacaa aactatacaa 56760 

atttctgctc tgtgaaagac aatgtcaaga gaatgagaag acttggaaaa tatttgcaaa 56820 

agacacatct gataaatgat tcttatctaa aatatatagg aactcttaaa actgaacaat 56880 

aaaaaagaaa cctgatttta aaatgtgcca aacactctaa tagatatctc atcaaagaag 56940 

atatatagat ggcaaataag catatgcaaa gatgctccac atcataggtc atcagaaaaa 57000 

tgcaaattaa aatgatgaga ttctactaca cacttattag aatggccaaa atccaaaaca 57060 

ctgacaacac caagtgctgg tgaggatgtg ggacaacagg aactttcatt tattgctggt 57120 

gggaatgcaa aatggtatag ccactttgga agacagtttg cagtttctta caaaattaaa 57180 
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cacactcatc 
cttatattca 
aaactttgaa 
tccagaaatg 
gaggaacctt 
aatgcagtgg 
ctgcctcagc 
ttttttttga 
aagtgatctt 
ctgacttttt 
gctggagtgt 
gagcccagct 
ctatggagga 
accatgggat 
gatgaattgt 
gggcagtgaa 
catagattct 
atggtgactc 
ggccaggagt 
gaaaccatgg 
gtactactct 
gtatacgaga 
aaaaaaaagg 
tattttcttt 
ttcggctcac 
agtagctggg 
agatggggtt 
gcctcagcct 
atgattttta 
tgattctatt 
atagatcagc 
ttggggtaat 
atgtcaagat 
aaaaatcata 
agtctccctg 
tgatcctcca 
catataattt 
cttgaactcc 
gtgtgagcca 
gacactgtgg 
cccatctcta 
actcaggagg 
atgtttgcat 
agatagataa 
acaattaagt 
acaggaaaaa 
tggattaaag 
accagcctgg 
atggcagtgt 
gcccaggatt 
cagagtgaga 
tggccttgga 
atagattaat 
agtgaaaaga 
ctggtgtcca 
atttaaaaat 
aatgagcata 
acacccatta 
tggaaaaatt 
gaaagttggc 



atatgatcca 
cacagaaacc 
gcaaccaaga 
gagtactcag 
tatattttta 
cacaattatg 
ctcccaagta 
gagattgggt 
cctgcctcag 
tttaaattta 
agtgatgcag 
aatttttttt 
aacttaaatg 
ttcaactata 
cagggagttg 
atgtctctga 
ataataccaa 
atgtctgtaa 
ttgaaaccag 
actttggatt 
ggtgggggat 
aatctgttta 
tcttaaaaaa 
ttgagacgga 
tgcaagctct 
actacaggcg 
tcaccgtgtt 
cccaaagtgc 
tcaaagatgt 
tgtataaatg 
agttgcctgg 
aaatatattc 
ttcttgttga 
aaattttgtt 
tatcacctag 
gcctcagctt 
ttaaattagt 
tgggctcaag 
ctgcacctgg 
ctctcacctg 
tcaaaaattg 
ctgtagctgg 
cactgcactc 
tatatgtcgt 
gggggaagaa 
gaatgaagtt 
acctaagtat 
gcaacatatt 
gtacctgtat 
tagaggctgc 
ccttgtctgc 
ttaggcaatg 
tggacttcat 
gaatccacac 
gaatatataa 
gttgaaagga 
tgaaaatgta 
ggatggctaa 
agaaccctca 
tattcctcag 



acaattgcag 
tgcacatagt 
tgtccttcaa 
tgctaaaaag 
tttttttaga 
gctcactgtt 
gctgggacta 
cttgctgtgt 
cctcccgaag 
tttttcttct 
tcatagctca 
aaaatttttg 
catatttcta 
agacattctg 
gtcaggggag 
tacagtaatg 
cagtgaacct 
tcccaacact 
cctggtcagc 
ataatgatgt 
gtttataata 
ccttcctcta 
taaaattata 
gtctctctct 
gcctcccggg 
cacgctgcta 
agccaggatg 
tgggattaca 
aaattaaaac 
tctagaaaat 
aggcaagagg 
attatcttga 
atactttatg 
tgttttaaaa 
actggagtgc 
cctaagtagc 
ttttgtagaa 
caatcctcca 
ccatcttaat 
taatcccagc 
aaaaattaac 
aggatcactt 
cagcttagat 
agttttaaaa 
tagtttttcc 
agatccccta 
aagctgagac 
gagacctcgt 
tcctagctac 
agtgagctat 
tccacccctc 
gtttattaaa 
caaaattaaa 
aatgggagat 
agaacacttg 
tttgaataga 
ctcaacctca 
aataaaaaaa 
tacactgtgg 
agatttacca 



tccttgatat 
tgcttacagc 
taggtaaatg 
gaatgaggta 
gacagggtct 
gccttgacct 
caggtgcatg 
tgcccaggct 
tgctgggatt 
agagacaagg 
ctgtggcctc 
tagagatgga 
agtgaagaag 
gaaatggcaa 
aatgaatagg 
gtggacacac 
taatataaac 
ttgggaggtc 
attgtgaggc 
ttcagtgtag 
aaggaggcaa 
aattttactg 
tttgagggaa 
gtcccccagg 
ttcacgccat 
tgcccggcta 
gtctcgatct 
ggcatgagcc 
aaatgtaaaa 
gcaaactaac 
gaagaattgc 
ttgcagtgat 
tagtttattg 
acattctttc 
tgattgcagc 
tggtaccaca 
atagggtctt 
tcttggcctc 
ttttaatatt 
ataagaccag 
tgggcatggt 
gage ct age a 
gagagagtga 
atatattatt 
aacaaatggt 
cctcacacca 
aagaagatta 
ctcttaaaaa 
tcagaaggct 
gattgeacca 
ctccacaaag 
tatgacatta 
acctctgtgc 
aattttttgc 
caactcagca 
catttcttca 
ttggtcatta 
gatgaacaat 
atgggaatgt 
catggcacag 



ttacccaaag 
agctttattc 
aataaataaa 
tcaagctatg 
tgctctgtca 
cctgggcttg 
tcaccacacc 
ggttttgaac 
ataagtgtga 
ecttgetttt 
aagatcctgg 
gecttgeaat 
ccagtctgaa 
aacgaaggca 
tggagtacag 
gttgtatatt 
tatggacttt 
aagatgggag 
cctatctcta 
gttcctcagt 
tgcatgtgtt 
tgaatccaaa 
aatatttgaa 
ctggagtgca 
tctcctgcct 
attttttgta 
cctgacctcg 
accgcgcctg 
ataaaaaaca 
ttataatggc 
agtgaggtat 
gttggcacag 
cataacaatt 
tttctttttt 
cttgacctcc 
caggegcatg 
gccatgttgc 
ecaaagtget 
taaaagaaaa 
cctgggtaac 
ggtggcctgt 
cgttgaggct 
gaccctgtct 
aagataaaaa 
gctggaacaa 
tataaaaaaa 
cttgaggctt 
agaaaaaaaa 
gaggecagag 
ctgtactcca 
tgtaaaggta 
aaagcacaaa 
ttcaaagggc 
aaatcatgta 
ataaaaagac 
aagaagacat 
gagaaatgea 
aacaaatgtt 
aaaatggtgc 
caattctact 



gagttgaaaa 

ataattgeca 

ctgtagcata 

aaaagacatg 

tccaggctgg 

agctctcttc 

tggctaattt 

tcctgggctc 

gccactatgc 

tattgeccag 

gctcaagtag 

gtggcccatg 

aatgttatat 

acaataaaaa 

aagattttta 

tgtccaaatc 

ggggctgggc 

gatcacttga 

caaaaataag 

tataacaaat 

ggggcaggag 

actgetctta 

ttattattat 

gtggcacgat 

cagcctccct 

tttttagtag 

tgatctgccc 

gectatttga 

aatcactgtc 

aaactactct 

gataaaactt 

gtacatccat 

ctataaaatt 

ectgagaegg 

taggctcaag 

ccaccacacc 

ccaggctggt 

gaaattacag 

gtaagggeca 

atggcaagac 

ggtcccagct 

geaataagee 

caaaataaat 

geaagecaag 

ctgcatagcc 

taactcaaaa 

ggagttcaag 

tcagccgggc 

gattgettga 

gcctgggtga 

taaatgtttg 

caacaacaaa 

acaccaagaa 

ttttacaaga 

aagtaacaca 

ataaatcacc 

aatagaagtc 

ggcaagtatg 

agacactttg 

tttaggtgta 
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57660 

57720 
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tacccaagac aattaaaaag atatatacag gccgggcgcg gtggctcaag cctgtaatcc 60840 

cagcactttg gccaaggtgg gtggatcacg aggtcaggag atcgagacca tcctggctaa 60900 

tacagtgaaa ccccatctct actaaaaata caaaaaatta gctgggcgtg gtgggggggc 60960 

gcctgtagtc ccagctactc gggaggctga ggcaggagaa tgtcgtgaac ccggcaggcg 61020 

gggcttgcag tgagccgaga ttgcgccact gcactccagc ctgggcaaca gagcgagact 61080 

ccgtctcaaa aaaaaaaaag atatatacac agaaaaactt gtacatagat gttcatagta 61140 

gtattccaat aaacatgccc atcagtagat gaatggataa gcaaaacgtg gtgtattaat 61200 

aaatgaaata ttatccagcc acaaaaagca atgaagtact gatacatgat ccaatatgga 61260 

tggaccttga aaactatact aaatgaaaga aaccagccac aaaaggccac atagtacatg 6132 0 

attacatttg tataaaatgt ccagaattag caattccata aagacagaaa gtagattagt 61380 

agttgccaag ggctgaggga aggaggaatg ggagtgactg ctaatgggta cagggtttct 6144 0 

ttttggggtg agaaaagtgt tctggaatta cataatgatg atagttgtac aactttgtga 61S00 

atatactaag acacactgaa ttgtatcttt taaaaagtta aattttatgg tatgtgaatg 61560 

atacctcatt aaaatagtta catgagaaaa aaatcaaagg caaaatacag agtataattc 61620 

aagtatttta atttttaaat ataaagtatt tatagccaaa tttgatttac ttctaaaatg 61680 

tcttattaaa tagtttaata aaagcaaaac tgttctagca ttcagtgttt attaatttgc 61740 

aatacacaaa caatatcatg ttttactcat gttgggtcca ccacatatgt atatatttaa 61800 

gaataatgtg attggtcagt actgcaaaat gttttcgtgt acggtggctg tgagtaccat 61860 

actaattagt acactaagaa ctatgaattg gaacaggaag aaaagcaaga aaatgaacat 61920 

tcagcactat ttggaaatga aacctcacta tgcaagaatt cattgcattc atgccttctc 61980 

agggggagtg tttgataaat taatatttca ctaacctaag agcttctgct gctcaaatcc 62040 

tctacacaca ctaatgcagt gtcagtctct gagtttggca gtggcacagc cataacttta 62100 

tggcagttag gtgaaatccc tttttgtttt gtggacatag aacaacagat gtggagttgt 62160 

acttctctgg ggccagcaca aacgccatta agtctgtgag tctcattctg tcagtgctga 62220 

gcatccatgt gttctttgat ttgcgagtgt gtgtttgtga ttggcaggtt tctcttacgt 622 80 

gactctgaag gagtgttgct gatcacatca cacccatcct taggccttat tgatcatcag 6234 0 

tgtaccttcc cactactata ctttaaatag atgcctgtta tttaaatttg attttgagat 62400 

taaacagaat ggcagagaca attttgagac acatctttct ttcatgctct ggtaggaaga 62460 

tcaagatttc taggacagta gaacagagta aaagatgagt gctgttggga tcctatcttt 62520 

ctcctaagct attttttctc tctcagttat tcatcatcta tctcaatttt tcctaatgaa 62580 

ctcttctata taaaagagga tccaggtccc acatccactg tcaaggagga atgtaagatt 62640 

gacttgcaac tcagcctgtg tacgcagttt atggtttttc ttggcaggtt taatgttctt 62700 

tcttatctct taacttcttg ctattcaata gtaagtaact ccctggcaga attacctgtg 62760 

gctagagaat gctgttatca gagcatcttt gtttaattgg tacttagaac agaaggtgtc 62820 

acctatttga caggccaaca attatgagca aggaggcatt tgattcatca agatagaaat 62880 

ctgcctgtta ggtggaaaca tgtctatgtg ggttgatatg tttttagaat attaaggctt 6294 0 

gtttgtgcat gacaacttta ggaaggtgta ctccaaatgt ctccaaaggt ttgctgtagt 63000 

tctttacaga agttgggctg ctcctggtgg acagtgtgta acagtgaaca atgtatgctc 63060 

tagactgggt tcccttcctc caccctgtgt ctgtgtggcc ttgggcaagt tgtttaacca 63120 

accacttttt gcctcagttt ctttatcgga acaaggagaa taagaatact tcaatcaggc 63180 

caggcgtggt gactcacgcc tgtaatccca gcctttggga ggccgaggtg ggtggatcac 63240 

ctgaggtcag gagttccaga ccagcctggc caacatggtg aaaccccatc tctcctttac 63300 

ttatgctggc ctgatattga tcgtccatgg tagaattgat actgcttgac aaagcagcct 63360 

atttcagtca ggacccttct tctctagttt cctctgtagc tattacctta gccttccatt 63420 

tcattcttca cactacagat actctcattg ataaaggaat gatgtcttta tgctttcaag 63480 

cattctggca agttagtaat tcaactatga ttctaggtca gacaaaacca gttatgaaca 63540 

taagactgtt tttaatctcc tccctggtcc cccaaccacc caccccaatc aggagaaact 63600 

atgttctgca ttggtttaag gaacccgctt ctttctttga tacctgacct acagatccaa 63660 

tctattccca ggaattttga taagaattct caaatcctca gcaaggctat gccactgtca 63720 

tgactctcct attcctggta gtgtcattct cagtgtaggc tgtttgatag gtagttttgt 63780 

gaagtcttgt tcatcataat ggatcatatg atttttaaaa gcaggacctg ggtcatatgc 63 840 

ccagattaat ttcaacaaag ttggtatgtt ttcttcctaa aattaatttt ttatgattat 63 900 

caaagtttta tatgcataca gtataaaata tcagatttct ataaggctta taataaagag 63960 

tagcattcct ctgccctatt cccctctcat actgggcacc ttactccaaa atcaactgct 64020 

ttcagctctt ttagccattt cttatggtta tcttcatatt tcaaaacagc atgcttatat 64080 

ggttgtattt tgaattttca aacttagatt tttatctaca gacttcctta tggaagataa 64140 

atatttaact ctttttgtgc tactcctttt ctcatctctt aatgtaggct atgttaaaat 64200 

ttttggttaa atcaattaaa agccagtata gtatagtggc ttaagagtga gggtgctcac 64260 

ctcaaattcc agcctcacca cccattactt tgtgtgactt tgagcaagct ttaaaacgtc 64320 

agtgtctcag ttttgtcaac tgagtagata cctcatagaa ttgctgttga tattaagtga 64380 
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cttaatccta tgggctgaat ttgtctccca aagttcattt gttggaaact taatccccag 64440 

tgcatgtgtt aagaagtggg acctttcaga gttgaatagg ctatgagggc tctgccttca 64500 

tgaatggatt aatgccgttg ttgcagtagt gggttcctta taaaaggaag tgtttgaccc 64560 

cctttccttg cctcctcatg catgtgatgg ccttagccat gttataatgc agtagtaacg 64620 

cccttaccag acactggctc cttgatcttg gacttctcag cctccagaac tgtaagaaat 64680 

aaaagttttt tctttataaa ttacccagtc tctggtattc tgttatggca gcaaaaaaca 64740 

gactgagaca cttaatatat atgaagcatc tagactgtct ggcacattgt acattttaaa 64 800 

tcccagatat cgatatcatc aatatcatca tcatcatcat ctgtggctgt ataatacctc 64 860 

cctctgcatt taaaggatga gggctggtgt agcagttatt aatataagtg agctagttgg 64 920 

ctataaacct ctcctatagg ctttgccata aacttgtgtc atggataatc agagaatttg 64 980 

gacctcctaa tgacaggcca ctgacaagaa aagcccagag ggagctgatt gagcatgctc 65040 

agttctttct accaaaggcc tcaatcagac agattctctt tcccaggagt agacactgag 65100 

ggggtggaaa caggctctga gttgactcca ctgagaagtc cctgaatgga tagcagccag 65160 

ggaattaagg agtttccctg tggcaagtct ctactcgtac aatattagga cagctgtttt 65220 

tttaatttgt tctgtgggtg catatttttg atctccacaa cataactact tactcactgt 65280 

gttgcttttt gaaatttata acatctaaat ttgcaaatgt gaggtatgct ctcaggaaaa 65340 

ttatatatct gttaaaattc tttgctcctt ttaggacact tccatccatc tggtgacctc 65400 

tagtagcgta catactcagc aacaattaaa ggtagtttct ggcttgtttt tcccaaacaa 65460 

taatttgttc aaaatacaga aatttggaaa gtaccccatg atgagaaact tttgaaagag 65520 

aatttaatgt acagtaacta tttttttcct ctgaggaata attttgggaa aagaaatttg 65580 

ctttatattc aggcatattg agtaagttgc cttcattttc aaacaaattc atcacctctc 65640 

ctttccccat tccccaaact tgttcatttt cttcattctt aaactctgca tctgtcttcc 65700 

cctcacttta ccacgtagct aaatctctta agagccaacc tattgccact cccttcatct 65760 

tttttaatct aatggatcat aagtcctgtt gatttcatca ctcacatctc tcaaatcggt 65820 

ctccttggat atcttgctac ttctgatttc agacaattgt catttgtcac ttgcagcagc 65880 

cttccagctg ccttcctccc aatctcccct atgagcattc tgttcttcaa actgttgcca 65940 

aagaggaatt tctaaaatac cgtttgatca tgtaattcca ggttttaaaa acctaatgtt 66000 

gactctctga catttaaaga ataaagtccg tatttcttgg gatgactgac atggctttgc 66060 

tgtgattctg gctacatcct tagagaatca cttgcagata ttaattttgc atatgctatt 66120 

ccctttacat ggaatgttcc tgcctcctct ttcccccatc ctttgcctga tgaacgccta 66180 

ttcatcctcc atgtcacttt gggtgttacc ctgaggaaac agtggttatt ttttccctgc 66240 

ccatctgggc tggatacccc tcctaagttg tcccacaaca cacagtacac accttggtcg 66300 

attatcacac tgtttgttgt tctgccgttg tttgtgtctc ttgaatatga gtttcttgag 66360 

attagggacc tgagatcccc agtgcccagc agagcaggac ctgtttccac cccctcagta 66420 

actactcctg gggaagagaa cctacaaata aatgtgtatt gcatgaataa agctatatat 66480 

ccccttcctg ttcatactta tccatttaat ttctgaactc taaatgctgt atttttctcc 66540 

attaaatttg gttttattgg tttcagtttc ctctttatat ggattttgag tcctaatttt 66600 

gtcaaccagc atatcaacag ttctttctag cttttgtccc ccggtagaat ggcagctcca 66660 

tgaagacagg gattagtgtc tgttttgttc actgctgttt tctcagcatc tagaatagtg 66720 

ctggcacata tagatactca caaagtattt gttgaatgaa tgaaggtgtt gcatacaaat 66780 

ttgataaata aaacttaggt tttctttcaa atattaatcc ttagctccat tctgttcaat 66840 

atttttatta gagataacct ttaaaaatct tccttgtcac ataagagtta attccatgaa 66900 

tcttacagaa caaggctata ctgagaaatt caaggaacac cttaatgaat cagggttatt 66960 

tcattgtgag agagtataga aatggttgat agtatgctgt aagtaatttt actttacgtg 67020 

taagtacttt tctctggctt tccagaacat gctgttggag taagagagga atgccttatt 67080 

gtggaccgag gggatagatt tggatacagc cttctttgag gaaggtagag aggtcaacta 67140 

ttacatatcc agcagtaact tccctttcaa agactaagtg ttctcattca atcatttgat 67200 

actttttgtg catctactac aatttgaaga atagaagagg gaaatgcatg tgtaaggcat 67260 

ggtggcaaca tttaagaaac ccagatttgg ggatacaagg tgtgtgtgtg cacatgcatg 67320 

tgtgtgcaat ttaaatgcac agggtaaaga actttgagtc agtgacaatg agtggcaatg 67380 

gtggtaatta agtgctgggg aagatcaagg gagagagagg gtgctgagag ccaatagggt 67440 

ggaacatgtt tgaaagagct gggttctgaa gtattcctcc atagaagggc attttaaata 67500 

gcttttttgc gtccttattc tgttaagcat tattaaattg ttctcccatc tttaaaaggt 67560 

tccctagctt aggtgcactg gcaagaaatt ataaaagcag catgaccagg cgtggtggct 67620 

catgcctgta atcgcagcac tttgggaggc caaggcggga ggatttcttg agttcaggag 67680 

tttgagacca gcctgggcaa catggtgaaa ccctgcctct acaaaaaata caaaaattag 67740 

ctgggagcag tggcacgtgc ctctagcccc agctactcag gaggctaagg tgggaggttg 67800 

gcttgagcct gggaggtgga ggttgcagta aactgagatt gtgccactac acacagcctg 67860 

ggcaacagag cgagaccctg tctcaaaaaa aaaaaattaa attaaaacaa taaaagcagt 67920 

gtggttgctt ataaggagtg atggagtgga cagaggaggt ttttgggcta gtcaggtaag 67980 



21 



WO 03/076644 



$ 



CT7US03/06667 



gctggggtat 
taacttctga 
atataaagtg 
cctttctcct 
tattggttat 
aaagaggcaa 
tagatgtttg 
ttatagagtc 
ctatattgga 
actagcttcc 
taataacagg 
agtttttaga 
atttcagcaa 
cacatcacca 
ttctggcaaa 
cttgttactt 
tgggtaatga 
gcagtttctt 
aggccgggag 
atcacttgag 
aaaatacaaa 
ggcagtgcct 
taggtggagg 
aagactccat 
tctaccaggg 
ggtgacacac 
aacaggactt 
ggagggcagt 
gtagctggga 
tttttttttc 
cctcaagcag 
acgcctggcc 
ataaattcat 
gatttttcat 
tgtgataatg 
ttagcttaat 
ggttggtata 
catccctgtt 
agtaaaaaca 
gagcatctta 
cccaataatg 
aacaaaatga 
taatgatata 
atgcagaaga 
ataattaatt 
gtataaatag 
ccagatgatc 
ttgatataat 
ctagactaat 
cctgaaaagt 
gtcacagtca 
gggatcttgg 
atgtttgtta 
atcgtaagat 
gttcacaatt 
ttattttgag 
ctcactgcaa 
ctgggattac 
ggtttcacca 
cggcctccca 



gaatctagaa 
gctttgcctg 
ctcctaaaca 
tacaaagata 
gattttaaat 
aggttcacta 
aaggggatct 
ttatgagaaa 
gaatattttc 
cactataggc 
attatatgtg 
agcataaaac 
ccatgaaaaa 
gtcagggcct 
caacatgaaa 
tactcaagga 
agtaatgttg 
taagcatctc 
cagtggctca 
gtcaggagtt 
aatcagaaac 
gcaatctcag 
ttgcagtgaa 
ctcaaaaaaa 
gccaggatga 
agagggtaca 
tttttttttt 
ggcatgatca 
caacaggtgc 
tgtagagaca 
ctctccagtc 
caaaaaacat 
ttcttctttt 
tgctgtaaag 
aattagaaga 
gtagatacag 
tgtacatgta 
gcaacaaaaa 
accagggctc 
tatatcagaa 
ggagtatgtc 
agaagtattt 
aacaagtgat 
attccaagta 
ccccactcct 
ggaaaagaag 
aaggtttaca 
gggatgaaat 
catgggaaga 
actcctcaat 
agaggagact 
aatagaaaaa 
ctaataatgt 
gttaataata 
gttatgtaaa 
acggagtctt 
cctccacctc 
aggcacgcgc 
tgttggccaa 
aagtgctgag 



gtttccattt 
tttcacctga 
tcatctgcca 
agatgcttgc 
cttggtccag 
ctgggggaaa 
aattaggaaa 
atgataatat 
attatcttat 
cattaatcct 
cattaattta 
actgcacaaa 
tatgacatag 
atatagttgt 
agggaagcat 
agcaagtatt 
gcattggcct 
ttgatgaaag 
cacctataat 
cgagaccagc 
cctgtcttta 
ctacttggga 
ctgaaatcac 
aaaaaaaaaa 
tctcatctgg 
ccgcacaatc 
tttttgaaac 
tggctcactg 
atgccaccgt 
gggtctccct 
tttgcctctc 
aataatgtgg 
actcttgtaa 
aaagattata 
gttggaggtg 
atggttacat 
ttttctacct 
cacctagcac 
cttgcagaaa 
agcaagggag 
aaagggtcac 
gaatttaaat 
tgaataaata 
atttatgtag 
gtaagtgtgg 
acaacttcac 
tcaaaagcac 
ggcactttgt 
acaccagaca 
gctgtcaagg 
aaggagatat 
ggatattagt 
atcaatagta 
gaggaaactg 
tctgaaacta 
gttctgttgc 
ctggtctcaa 
caccacaccc 
gctggtcttg 
attacaggcg 



aaatagcagg 
ggttgcttca 
tgtggcaggt 
tttgagtata 
ccaccaccca 
agaggtgtac 
gtatttttcc 
tcaggttaag 
ataagaacct 
gttattattt 
tattacttca 
taaaatagta 
tatatgttgt 
tacagtccca 
gatggatgtg 
ttttgcccct 
tatagtggag 
atgaggctat 
cccagcactt 
ctggccaaca 
ctaaaaatac 
ggctgaggca 
accactgcac 
aaaaagacaa 
gttatggatt 
ttcctatctt 
aggatcttgc 
cagccttgac 
gactggctaa 
agattgtcca 
aaagtgttgg 
ttattcgaag 
ctttctgagg 
gacaagcaag 
tatgtaagta 
gtggaaagta 
cctttggcta 
catttatctt 

gggctgatga 

tactcaaaaa 
aggagccatg 
cagagtataa 
aataaatgtg 
gtaaatactt 
gctgcttagt 
agtggaggaa 
taagccatgt 
gtagtcgtgc 
aatcctaatt 
tcatctgaaa 
gacgactaaa 
taaaactgag 
gttcattaat 
gatgtgaggt 
ttctaaaatg 
ccaggctgta 
gcgattctcc 
agctaatttt 
aactcctgac 
tgagccaccg 



gagcccttag 
aagattgaat 
tctcaagaaa 
tttttaggct 
tatggtttct 
agaaatgggt 
tggtggtcca 
aacagtttat 
tgtaaaaatt 
caaagattaa 
ttccgcaaag 
ggataggagt 
taagactggg 
tagcaaattt 
gttaaataag 
ttgtttctta 
gtaataatgg 
gacatcaaga 
tgggaagctg 
tggaaacctc 
aaaaattagc 
ggagaattgc 
tccagcttgg 
ttctgaaagt 
ctaggtctgg 
ctttaaatat 
cctgccctgt 
ctcccagctc 
ttttaaaaag 
ggctggtctc 
gattacagac 
aagtgatttc 
tgaaaagtag 
gaaggagtta 
tgctcagcat 
tttatagcta 
agagggcgca 
ggtttgtaat 
taatatataa 
ctaaaaacaa 
tgaaagagtt 
aataaatatc 
ggagactaga 
cactgtcaaa 
gacttccttc 
tctggcaaac 
tgatactgca 
ttccaaaaaa 
gagggacatg 
caagaaaagt 
tataatgtgg 
gaaatctgag 
tgtgacaaat 
atatgggcac 
aaagtgtatt 
gtgcagtggt 
tgcctcagcc 
tatattttta 
ctcaggtgat 
cgcccagaca 



gcaaatcact 
gaaaccgtat 
tgttagtttc 
tcctgatcat 
gcagttaaca 
gtagagaaag 
gaatttaaaa 
tatttcctct 
tttcttctta 
acaactatag 
gatttgaggt 
agaaaataaa 
ggaatgtaaa 
gactctaagc 
acacaattta 
taggagatgc 
attttatcag 
gacaattctg 
aggcgggcag 
gtctctactg 
tgggcgtggt 
ttgaacccag 
gtgacagaac 
ggtgagtcgt 
cctcatctaa 
cactgctttc 
cacccaggtt 
agccttctga 
ttgtttcgtt 
aaactcctgg 
gtgagccacc 
ctctcaaaac 
gaagttccca 
gaataacctg 
gaatttatgt 
tgcatagata 
gaagccatga 
actattctcc 
gattagcctg 
taaactgctc 
tgcaatagcc 
tacgagtcca 
caaatccccc 
gaggcagagc 
caggagtgca 
tctgtcccac 
ctgtgtttcc 
cctcatagcc 
ctacttaatt 
ctggcaaact 
tatcttgtat 
taaaatatag 
gtaacatact 
tccctgtact 
ttattttatt 
gaaatctcgg 
tcctaagtag 
gtagagatgg 
ccacctgcct 
aaagtttatt 



68040 
68100 
68160 
68220 
68280 
68340 
68400 
68460 
68520 
68580 
68640 
68700 
68760 
68820 
68880 
68940 
69000 
69060 
69120 
69180 
69240 
69300 
69360 
69420 
69480 
69540 
69600 
69660 
69720 
69780 
69840 
69900 
69960 
70020 
70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
70980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
71520 
71580 
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ttttaaaagg taggaaagtt tcacattttg atcgtactat tgagataaaa cttggttgtt 71640 

gttgttgttt ttgagatgga gtcttgtact gtcacctgga ctggagtgca atggcatgat 71700 

cttggctcac tgctacctcc acttcccggg ttcaagcgat tctcctgcct cagcctcctg 71760 

agtagctggg actacaggca cctgccacta cgcccagcta attttttgta tttttagtag 71820 

agatgggttt cattatgttg gccagactgg tctctaactc ctgacctcgt gatccgccta 71880 

ccttggcctc ccgaagtgct gggattacag gcatgagcca ccgtgcctgg ccaaacttgg 71940 

tgttttcatc tgagaatgag gtttccaaga attgaagata caagttagct aaatagtatc 72000 

agtgaaacca cagagtataa cttcagacca cttgtggttt taaggcagat ctatgccaca 72060 

gagaagtatt tgaatttaaa tcaaagtata aaataaatat ctatgagtcc ataatgatat 72120 

aaacaagtga ttgaataaat acataaatgt gggagactag acaaatctcc catgcagaag 72180 

aattccaaat agtttatgta ggtaaatatt tcagagtatc agtactatga tccttcttac 72240 

aatgctatat ccgactaaaa aaaaaaaaaa gaaaaaaaga aaaagaactg tgatcctagt 72300 

tgcgaatgtt gacttatagc cagaaaagat gactttctta tttgaaagca tcatgaaatg 72360 

ctatcagcat gcagaaatga gagggcataa cagagagcta ttctatcgta tttttattta 72420 

atgttttaaa agagctatta ataatataat gtgaagagaa gttttaattt cagaggaatg 72480 

ggatgtttgg ttttaagatt atttttgcaa aatgtagtct gtcattttta aagtgaactt 72540 

tataatgaaa agagtttgaa attatttccc tattattaat atttatttat ttttattact 72600 

tttctgccat gacaattaca gtgcacattt tccctatttt tagctttttt ctggtaaagt 72660 
gattttacct cctgtagatc gtcagtcccc agactttctg gcactaggga ctggtttcat . 72720 

ggaggacaat tttttgacaa attggagggg gaatgatttc aggataaaac tgttctatct 72780 

cagatcatca ggcattagat tcttacaagg aatgtgcaac ctagatcccc tgcatgcaca 72840 

gttcacaata gggtttgtgc tcctatgaga atctaatgct gtgctgatcc gacaggaggt 72900 

ggagctcagg tggtaatgct cgctcaccta ccactcacct cctgctgtgt ggcctggttc 72960 

ctaacaggct cgggggctgg ggacccctgc tgtaaataac ctttcgaaga tctgaaaatt 73020 

aactttagta tttttttgta tttactcgat attttaaaca aacaaaaatc tagagaatga 73080 

cataacaaat gtattttctg cgtatccact actctttact cagatcctaa cattctcagt 73140 

tttgaaaaga tgtaggctgg tcacagtggc tcatgcctgt aatgtcaaca ctttaggagg 73200 

ccaaggagga cagatcacgt gaatccagga gttagagacc agcctgagct ataaggcaaa 73260 

accctgtctc tacaaaaatt agccaggtgt ggtggcctgc acctgtagtt ctagctactc 73320 

gggagcctga ggtgggaggg tcaattgacc agaggctata gtgagctgtg atcctgccac 73380 

tacactccag tctaggtggc aaagcaagac cctgactcaa aaaaaaaaaa aaaaaaaaaa 73440 

aaaaaaaaag atgtaaacat tacaggccca gctgtggtcc cgtgtacact tctccattcc 73500 

taatcccctt catgcacact ttcccacagg taatcactat cccacatttg ctttaattat 73560 

tcccatcatg ttttatgcta tgactacaaa tgtgtgaatc tatattccac tgtatgttaa 73620 

taccaacatt ttttcattta ttttaccgct gatgggcaat ttaggttatt tccagatttt 73680 

tactattaca gatgcttctg tgaacatcct tactcatttc tccttgtgca catgtgcaaa 73740 

atttctctgg agtacatacc tgggagtgga attactagaa tgtggtgcat gtacatgttg 73 800 

ttgtatttat cagtagttca ctcttattta ttgctgtgta ttccattaca tagatatacc 73860 

ataatttgtt tatctaatca cttatttata gacatttggg ttctttctgt ttttttgact 73920 

attagaaata aagcagctat aaacatttgt atataagttt ttgtataggc atatgctttg 73 980 

gtttctctta ggctcagggg ctgggaaccc cctaggaatg gaggaatagc tgggtcatat 7404 0 

ggtaggttta tgtttaactt tttaagaaat tcctgaactg ctttccaaag tgtttgtact 74100 

attttatatt cccagtagca gtatgtgaaa gttcttgttg ctctacatcc tcaccaacac 74160 

ttggtatgat tagtcatttt tagatatcct gtgatgtgtg tggtttttca ttgttttaat 74220 

tgacagttcc ctgatgacta atgatattgg gtgtcatagg catattttcc atttatatat 74280 

cttttttgta aagttcaaat tttttgccca tttttaaaat tgagttgctt ggtttttatt 74340 

gagttctgag aattttgtat gtaatttata tacaagttct tactagatat gtaatttgca 74400 

aatcttctct ctcagtttgt ggcttgtctt tttattctct tagcagtgtc tttcaaagag 74460 

aagttcttag ttttgatgaa gtttagttta tcaacatttt cttgtactga tttcttgtat 74520 

gggcatttag gtttttcatg ttatatctaa gaaatctttg cctaaaccaa gatcacaaag 74580 

attttctccg gtattttctt cttttgtttg agacagagtc tcactctgtt gcccaggctg 74640 

gagtgcagta gtgcagtctc agctcactgc aacctccacc tcctgggttc aagcgattct 74700 

tgtgcttcag cctcctgaat agctggtatt acaggcatgt gccaccatgc tcagctaatt 74760 

tttgtatttt tttaagtaga gacagggttt caccatgttg ggaaggctgg tcttcaattc 74820 

ctaacctcag gtgatctgcc caactcagcc tcccaaagtg ctggtgacac aggattttgc 74880 

tcagctactt tgccaaccag ggactcactc ggccatgggc caaggcaccc cgctcacttg 74940 

gtccacctgt gctatagctt ctactgacgt tcagcggttc ccgagctctt gtcatgcatc 75000 

taaaaagaag gaggatatgc tgacaatttg aagtgtaagg atgggtggag aagaatttta 75060 

ctgagttatg gaacaactct cagcattaag gggacacggg gtgctccctc acccccacag 75120 

tcaggtggtt tttctctctc tctctctgtg tctgggtctg gggcttttta tggactcaga 75180 



23 



WO 03/076644 



• 



CT/US03/06667 



atggggagtg 
ttcaaaggtg 
catggtggct 
agatcaggag 
caaaaattag 
cagaagaatt 
acttctgcct 
agaaagggta 
caaatgggaa 
tttaaagtgt 
ctgcctaggc 
taactgccat 
ggggtgctgt 
cagtaaaagg 
aatgagaaga 
cagctcaaaa 
gctaagattt 
acaaagaaac 
ggataattgc 
tcttctgagc 
aaatgcagac 
taaattttga 
ataggactga 
aaagcacagc 
ctgttccaca 
tcctcaaata 
ggttcctggg 
agggactgtg 
gcaagtcaaa 
caaccagttt 
caaataactg 
accaggcaaa 
ttgttaaaag 
ggatcagcaa 
ctattaactc 
attttccctc 
acctgtcaaa 
attgtctgtg 
aaatttggtt 
ttttgagacg 
actgcaagct 
ggactacagg 
tttcaccgtt 
ctcccaaagt 
ttacgattga 
catatgtatt 
aatcccacat 
agccctctgt 
attttgggaa 
ttccagatta 
gtaaaaacct 
tctctcccac 
agtttgacca 
agtgccttaa 
ataccctttt 
ttacaaacct 
tctaaactag 
tttactttcc 
aatggtaaac 
gtgctaccaa 



tgtacagatt 
ggcgcgatag 
cgtgcctgta 
tttgaaacaa 
cagggtgtgg 
gctggaacct 
ggctgacata 
ggtatgtgta 
gacaggttct 
cttcagcttg 
atttgtctgc 
tagaataggg 
tttgggaaaa 
gagccatcgt 
caaccagatt 
atcctgaggc 
gggtgaatga 
ctctgggtta 
ccagaactag 
tgcagctgat 
aaaaacttaa 
aacataattt 
gtcatttgca 
aagaataatt 
aggaatttca 
cctatgagtt 
cctattagaa 
tagaagacaa 
cttgatttct 
ctccagttgt 
tattgctgca 
aagaaataaa 
ctatagatag 
tgttttaagc 
ttcttctgct 
tattccaatg 
gtcccatagc 
aatgacaaaa 
atttctgagc 
gagtctcgct 
ccgcctcccg 
cgcccgccac 
ttagccggga 
gctgggatta 
tagcatatac 
tttcattaaa 
aactaaacat 
agtattcaaa 
gcctgttaag 
ccataagtca 
ttactcatta 
accctttatt 
tgaggtgaga 
gaaaaccttg 
gaatttagtc 
tccacaacct 
gcaaaaattt 
ttacacacct 
cttagcaatt 
ttatacctta 



ggtttgtgag 
tgtagaaaac 
atcccagcac 
gcctgaccaa 
tggcacacac 
gggaggcaga 
gcgagacagt 
aaataggtgg 
caatccagtc 
aaggtcaggt 
ctcttgctgc 
atgatgactg 
tggcagtcag 
ctgaggctcc 
attagaaggc 
tgccaacatg 
ggctcggctt 
cgggcaccct 
aatattgatc 
gatcactggt 
aaacaactaa 
ttctctctcc 
gaataaactt 
atttttcaca 
gataagacct 
gggtaaattc 
agtgacattc 
agtatgaggc 
taaaggtaag 
gtcctgttgc 
aattaagaat 
tgtgctccaa 
ctcaacatga 
aaaaaaagat 
tgatattcat 
tcacaatctc 
tgattataaa 
tgtctttggg 
tttacaataa 
ctgtcgccca 
ggttcacacc 
tacgcccggc 
tggtctcgat 
caggcgtgag 
tcagacatta 
atataacctg 
gtcagttaat 
agtaaggggt 
tacattagag 
tttatttagc 
agagtgaaga 
ttttgatgaa 
ttcttataaa 
ttcttttatt 
aatatgttca 
tcaactttat 
aaattcccat 
tgatgtaaat 
tttaatttta 
agttgtagcg 



tatgcaataa 
caattaggaa 
tttgggatgc 
catggtgaaa 
ctgtagtccc 
ggttgcagtg 
gtctcaaaaa 

agggtgggga 
catggatttg 
ttcaccaggg 
tatcagttct 
atactaactg 
atctccctca 
agtttcatga 
atgtatcaaa 
cccagataac 
tggtcagctt 
gtttactcct 
cagattttta 
tggttcacag 
tgagactaga 
agtcctcatt 
tagtcttata 
caggctttta 
tttaaagctg 
ctctcttctt 
tttattcacc 
cagttttccc 
cacacccttc 
aaaagaaaat 
actcacaaat 
attttgttca 
aagtttcctt 
tacttcagct 
gaacatttca 
caaagttatc 
ccatcttttg 
taataacagt 
caacataata 
ggctggagtg 
attctcctgc 
taattttttg 
ctcctgacct 
ccaccgcgcc 
gaattttaga 
aagaagatta 
cctgtttacc 
cagaaaagac 
gtttaaaaca 
caaaatgatg 
cagctttcca 
atctttagat 
cctttacaaa 
ctaatgttca 
cacacagaat 
cttatccaac 
gccttcttat 
ctgttttcag 
atgtaaagcc 
actctggtgt 



aagttaaagt 
agggtaggta 
caaggcaggt 
ccccgtctct 
aactacttgg 
agccaagatt 
aaaaaaaaaa 
tcaatcagag 
cctgggactt 
acccgttcct 
cccctctgaa 
cttcctgctg 
gaggcctatc 
ctggagttta 
acaaaataag 
aggtggctat 
ctttggtctt 
atcacctggc 
catcacccat 
aaataagcag 
atttaatgaa 
tttgttaaaa 
tttggcctgg 
aaattggctt 
agcccagcca 
gaggtcccaa 
acagattagg 
aaggggcttt 
cagtcaaagc 
ggattcttac 
agtttccaag 
ctggagtata 
gactctgaaa 
ttctattagt 
gctcttcatg 
agaaacctgc 
aaaaggatca 
caaagccatg 
attttttttt 
cagtggcggg 
ctcagcctcc 
tatttttagt 
cgtgatccgc 
cggcaacata 
aacctcatag 
aacattattt 
tctcttttgg 
aaccttgaaa 
ctttatatta 
actcaaaaat 
aacaaacaat 
aattctgtcc 
tttttgttaa 
atttacagaa 
ttcttttgca 
ttaaaacaat 
aatcctttac 
tagtctcaat 
tggtaagtta 
gctattggta 



gaagacaata 
tgtagcctgg 
ggatcacctg 
actaaaaata 
tagactgagg 
gcaccatggc 
aaaaaaaaaa 
gaaagcatgc 
gtagctaggc 
acacccctat 
gaggtacatc 
acaggggtgc 
taagggtccc 
atggcctgaa 
ggggtaagga 
agttatgcct 
attttcccaa 
aggatttgca 
cccttttgtt 
ggttagtcta 
aagtgtatga 
acaaatcatg 
ttatttgcat 
tgatggaact 
tgggtttgta 
gataacatgg 
aactctgtac 
tattggttct 
cttggtaaaa 
tgcactgatg 
ttctgaggaa 
ccttactcaa 
aacaaaacaa 
tcagtacatt 
agtcctgtac 
atttgagagc 
aaataagaca 
attgacaaag 
tttttttttt 
atctcggctc 
caagtagctg 
agagacgggg 
ccgcctcggc 
ataattttaa 
aattttggaa 
ttattttggc 
atgctccagg 
ctgaagtttg 
tgaaatacaa 
ttttaaaaag 
ccatctctgg 
aatcttaacc 
agagtagatc 
aaaccatgta 
agattaattt 
tatttaatcc 
taaaaacaca 
tacatggtat 
ttttaattat 
atatggcctt 



75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
78780 
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acaaaactga 
tggtggctca 

gtcaggagtt 
ataaaaaaat 
aggcaggaga 
catcctggac 
cagtttataa 
atgtttacat 
aaatattaaa 
tttgatctaa 
atcacacata 
tcattgtcca 
ctaggaaagc 
attttgagag 
ttttctaaaa 
gaagttatct 
taattcagtt 
aacatagagg 
ttttaccata 
cctcaccaag 
aaaccaacaa 
gagatggtaa 
atttaagaaa 
gtcacacaga 
cattgtaaaa 
caaggacata 
aaagtctgta 
gtctgtgttc 
atagcagatt 
ctttacagaa 
agaggccaga 
ggttcccttc 
tcacagatat 
tctgtagtag 
aattatatat 
ggttattaca 
agctaccaaa 
gtattttcat 
ctttgcttta 
gatttgcatc 
aggaaacaaa 
agatttctta 
aaataagagt 
acattctatc 
gaaggtagag 
acattaaaga 
atgtagagtc 
tagcttaggg 
caaaaagtgg 
ctactgggca 
aagtgggtat 
agagttctgt 
cttgggggga 
ggactagggc 
gagagagaga 
gtggcatgat 
cagcctgcca 
tatttttttt 
cttcaagtaa 
gttcctggtc 



aaagcaagca 
cacctgtaat 
caagaccagc 
tagttggtca 
atcacttgaa 
aacagagtga 
ctttaaagca 
tttgaagaca 
atcatttgaa 
gtgcttattt 
ttgcacatat 
atctcctaat 
atgcagtttc 
ggatttatct 
tggggtcagt 
tagggcctct 
gactgaaaaa 
ccttttaaat 
gtgctcttat 
tggccaatat 
gccttaagta 
gcagttttta 
ggaagctaga 
tattagccag 
tggcagagac 
aaacaagatg 
actgaccagt 
tatcctgtct 
ggctacagct 
gaataaacag 
agtctgactg 
ccagttcaat 
tttccagtat 
ggtttgcatg 
atatttgcat 
gagcatagat 
gggcaaaaaa 
ttgtaaacac 
tttggcatgc 
ctgcttttgt 
tagggtctgg 
acatggcttc 
tatacaagca 
tggtgttctg 
aaatctctta 
acactaaaat 
caattaacaa 
gaagaggcac 
gcatttttat 
agtatctgag 
gctttcaaca 
gggggcatgc 
aaatatatat 
tcgattgtta 
gagaaagaga 
catagcttac 
agaagctggg 
tgtacagatg 
tcctcctacc 
tagtttgcac 



aggtgaacaa 
cccagcactt 
ctggctaaca 
ttatggtgta 
cctgggaggc 
tactgttgaa 
tttagtaaac 
tttgtatttt 
ctaaaaggta 
ttctctaagc 
ataactacac 
tagattactg 
tagggcctaa 
gcttttaatt 
ggtgcctctt 
catgcgtgca 
gaaaatcttt 
atgcctatag 
tttaaaaaat 
ttctggcttt 
aggttgtgac 
caaaatctag 
agttgttcat 
aaagtactca 
caaaagaaag 
gagaccttat 
ttgcatggct 
tactccttat 
taagattagc 
tgatttttat 
gtaagaactt 
tttaagccaa 
gtttaaatag 
accactttat 
acgtttattt 
accttgaaag 
ctattcatca 
tttgcccttg 
aggagaatct 
catacattta 
atagggaaat 
ttggatgtgt 
catcttgcac 
atcgaaccac 
tttgttttta 
tttaatagtg 
gagcaagttc 
aaagcatcct 
aaggtagggg 
ctggcacctt 
tgccctccta 
tttggtctgc 
taggaagtcc 
tttatttatt 
gagagacgag 
tgcagcctca 
ggtactgttg 
gggtcttgct 
tcagcctccc 
tttttttttt 



ttttcaaaag 
tgggagaccc 
tggtgaaaat 
tgcctgtagt 
agaggcaacc 
aaaacaaaaa 
ctagtatctg 
accaataatc 
ttatagcttt 
caattaatta 
agacagagga 
acctcaggat 
taaatagtta 
ctttgggttt 
ccattttttc 
ttaagagagg 
ttccagtgaa 
cttggatatc 
ccttttaaat 
tgaactttac 
ttcactgcct 
aatctttaaa 
ggaggggaag 
ttccctaagc 
tccttccacg 
tcagtttttt 
ggcttgaaca 
gacagaactg 
ctcacaaatc 
ccttcctttt 
ttactctttt 
gcagtttaag 
ttttgtttag 
tccaatgcca 
tacataattt 
aggccaagat 
acctttaact 
cagtcattta 
tcttaaatat 
gagccttggg 
tgaaaagctt 
ctctctgatg 
atttttggca 
ttttgcatgt 
tattcccaac 
taactgaaaa 
tgatacaaag 
gcctttaaat 
aggaaatgag 
cttgggcaga 
gtgggcatga 
aaatagactg 
tctgtgggtg 
tatttattta 
gtcttgctcc 
aactcctggg 
tgtgctacca 
ttgttgccca 
aaagtgctgg 
tttgaaacag 



cccaagaagc 
agccaggtgg 
ccgtctctac 
cccagctact 
aagatcatgc 
agagaaaaaa 
acctgcataa 
tctaaaactt 
tatttttcct 
gagctctttt 
tccagtagtt 
ggagcccttc 
tagctggaag 
catgaggaaa 
cagggagtcc 
caagacaaaa 
acaagatgca 
cacttttaat 
cccttgttac 
caaaagtaac 
gtgtacaagg 
gatagctcag 
agaatcagca 
caggattgaa 
tggttacaag 
ttccttcaga 
gtgggcttat 
tacagaaaga 
cttttttcca 
actggtttgc 
actggcatgt 
gtttggggat 
cccagattta 
caaacagaag 
aaggctatta 
ggagtggaaa 
acaataaatc 
catcggcata 
taaacctaac 
attttttctc 
cctggtattt 
tcaaacatac 
tctatgtctc 
tactagactg 
agcctaggat 
gcaggttagt 
aagtgtattt 
gtgccacttc 
caagggcaag 
agtaagttgt 
gttctgagat 
ttaactttcg 
ttttgtagaa 
ttgtgtgtgt 
gttgcccagg 
ctccagggag 
tgcccagcta 
ggctgggctt 
gattatacat 
agtcttactc 



aggctgggtg 
atcacctgag 
taaaaaaaaa 
caggagggtg 
cactgcactc 
gccaaagaag 
tttagaccac 
tttatttttc 
tcagaaaata 
ttatacaaac 
gtaagatttt 
aagagcaggg 
acaaaaacag 
acagaggttt 
caggccatca 
tggagaaaag 
agaagaggaa 
taagctgact 
ccaactttag 
ctcacaggtg 
tattttcaaa 
agaaaggaag 
aatgataaaa 
cctgggcttc 
gtcaagctcc 
gacctgtagc 
ggggtcctag 
cacacaaagc 
ttaatcaaaa 
acagggagaa 
caggcttctg 
actaaacttt 
ataatagtta 
ttgattagca 
gaccaatatt 
tgcaggttgg 
ttccctttgg 
attacaacac 
cactttgagt 
agaatggagt 
ttctgttgaa 
acacatattc 
aagacacagg 
aaaattattg 
agagcctaga 
tggtcactgc 
atttcaaaac 
acctttggag 
tgtccctctg 
aaaagtggcc 
gaccctgtgg 
aggagagatc 
ggacctagag 
gtgtatgtga 
cgggagtgca 
cctcctgcct 
gtttttaaag 
gaactactgg 
atgagccact 
tcttgcccag 



78840 
78900 
78960 
79020 
79080 
79140 
79200 
79260 
79320 
79380 
79440 
79500 
79560 
79620 
79680 
79740 
79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81420 
81480 
81540 
81600 
81660 
81720 
81780 
81840 
81900 
81960 
82020 
82080 
82140 
82200 
82260 
82320 
82380 
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gctggaatgc 
ttctcattac 
aatttttgta 
cctgacctca 
ccaacacgcc 
agtagctatg 
agaattttgg 
cagctagctc 
tgaaagagaa 
tggattgttt 
agcaggtgat 
aacttcatct 
tgcagtacaa 
tttcaaacca 
attttatgag 
gttactctca 
gagaggggct 
cagaggacca 
ctgttcccac 
gcctttgtca 
gaagggtagc 
caacggtgag 
aatggtgcta 
tctgagaaaa 
tgttggaatc 
tccgacagct 
cttcccaggt 
acctaaagtg 
aagtgttcat 
tcaagacaag 
tctgtaattt 
ttgtgcaagg 
aatctcactc 
cacctgccaa 
catgagccaa 
tggccaggct 
ctgggattac 
cactcatata 
aatatttagt 
tgtcaggcaa 
tcttctagcc 
cagctgccca 
cgggttcaag 
accacgcctg 
tggtctggaa 
tacagatgtg 
tttttgaatg 
acaaattctt 
ccagggtggg 
accttgtctc 
agctactcag 
agcaaacatt 
aaaaagaaaa 
ggattgtgct 
tcaccaccat 
tatagtcttc 
atggcttacg 
aggagttcaa 
acaaaagaaa 
aggcaaaaaa 



agtggcacag 
tcagcctccc 
tttttagtag 
ggtgatccac 
tggccgtttg 
tacttcagtt 
ggattaaatg 
tattgagtct 
caacagtcca 
agatgccatg 
aactaatcca 
attgccaatt 
cagtgtgaat 
gttaaaggct 
gactcttgag 
cttgttctct 
tgagagggac 
gggtctgact 
agaaagagct 
gggcaaagga 
ctggagtttg 
tcttttgaat 
tggccagaag 
aagaattagc 
tccttgcctt 
agggttggtg 
tccctggggc 
accataagcc 
ctagtgtcaa 
aagctatatt 
ttaatatttg 
gttttgtttt 
tgtcccccag 
ggttcaagca 
cacgggcggc 
ggtcttgatc 
aggtgtgagc 
agcagtatat 
gttggcccct 
tgtttgctgt 
tgcctagtat 
ggctggagtg 
tgattctccc 
gctaattttt 
ctcctgacct 
agccaccgca 
aaattaaatg 
actagtctgg 
ccgaccactt 
tacaaaagat 
gaggcttagg 
gcgtcaccgc 
tttaacctgt 
aatatatttg 
ctaagtcaag 
tttctggtga 
cctgcaatcc 
gaccagcttg 
tatgactgac 
tttctgggct 



tctcagctca 
gaatagctgg 
agatggggtt 
ccacctcggg 
catttttaag 
tctcaattgt 
aaatagtctt 
ttattattat 
ttttattttt 
ttaaaacttg 
atcattaatt 
actattgtaa 
ttcaaaagtg 
ctaaaaccca 
aaaacggtgc 
tacttggtat 
gacttaagag 
ctccgagaca 
atactgtaat 
aatgtatttc 
actcctgtcc 
gacagccaaa 
gaactctgag 
ttcaaacatc 
ccttccccag 
taaagtcgtg 
tggggaaaag 
tttctttaac 
gggaaaattt 
tgattatatc 
gtatagaagt 
cttttctttt 
gttggagtac 
attcttgtgc 
taatttttgt 
ctgacctcag 
caccatgccc 
gcaatataag 
taattataaa 
aaaaaataaa 
ttatattaga 
cgcagtggcg 
atctcagcct 
gtattttagt 
caggtgatca 
cccagcctag 
attagagatt 

gtgtggtgac 

aagcccagga 
acaaaaatta 
tgggaggatg 
actccagcct 
ctcaagctct 
gccaatcagc 
atgacagtgt 
gggaagaaag 
cagcatttta 
agtagcaaag 
taaaatacat 
caaggtgggt 



ctggaaactc 
gattacaggc 
tcgccctgtt 
ctcccaaggt 
ataaaaattt 
taaacgaact 
ctgagcacaa 
aatagcagta 
gtggcatata 
acaaaactag 
tattcttgaa 
caaaatctat 
agatatttta 
ttaagaagga 
tgtgtcccct 
ctctccttgt 
atttaaagga 
cccaacaagg 
ggtgtctgtg 
ctgcagatta 
aaatagatgc 
aagccaggag 
gtgagactct 
agcagacagg 
cctccatctc 
gaagagaaga 
atttgattac 
attgaccaaa 
ttccccactg 
ctgttagtgc 
aggttgaaat 
ctttcttttt 
agtaggatga 
ctcagcctcc 
atttttagta 
gtgatctgcc 
ggccaaggtc 
gatatgctct 
agttgctttt 
taaaaaataa 
ttctttcttt 
cgatctcggc 
cctgagtagc 
agaggcggag 
gcccacctcg 
attgtttctt 
agtaaaattt 
tcatgcctgt 
gtttgagacc 
gctggtgtgg 
gcttgagcct 
gggtgacaca 
ccatactgta 
tccttcctac 
ttacacagtc 
aaaatagaaa 
gaggccaagg 
tgagaccctg 
ataattttca 
gatcgcttga 



cacctcccag 
acccgccacc 
ggccaggctg 
gctgggatta 
taccatgctg 
taatagaagt 
caaatatatt 
atagtcagac 
tatcataggt 
aaatgttgtg 
tttgatcaga 
tggaatttca 
tgtggctttt 
tactatgggt 
ccaccatggt 
tctctcactt 
ttccccccag 
agagaactgt 
ttgaatttgc 
gaggtgggcc 
ctgaaagggc 
aagaatgaat 
gaataaccca 
tgttttggca 
ccatgcctac 
aaagcagctg 
ctagtgaata 
agaatcaaat 
aataaatttt 
ttattcaata 
ccacagtaat 
tttttttttt 
cctcggctca 
tgagtagctg 
gagacgaggt 
tgccttgatc 
ttttttcttg 
tgggtttctt 
atctaaaagt 
aagtagatgt 
ttttgagaaa 
tcactgcaac 
tgggattaca 
tttcaccatg 
gcctcccaaa 
aaaccataga 
ttgtataaga 
aatgccagca 
agcctgggca 
tggcacacac 
agaaggcaga 
gcgagaccct 
aggctctgca 
tgtctacttt 
tcttcattgt 
tatggcttac 
tgggaggatt 
tctgtacaaa 
taatacttta 
acctaggagt 



attcaagcga 
acacccagct 
gtctcgaact 
caggtgtgag 
gatatattgt 
actaccttat 
atcagcacaa 
ttggaagggg 
cgtaagacct 
agtgtgcaat 
agacagacct 
gtttaggcac 
taaagtaggt 
caggaatagc 
acaagtggag 
ggtatcgagt 
ttgtggggaa 
ggctggctta 
ctacactcca 
tagcggagga 
agtgaggttc 
tatccaagtc 
tgaaagtgca 
ggagggctca 
cttggaaaga 
atttcaatcc 
ttggtttgtt 
gggcctgttg 
aagaaggcag 
gacacataaa 
tcacagaaac 
tttgagacag 
ctgcagcctc 
ggattacagg 
gtctccatgt 
tcccaaagtg 
aaaatatctt 
gatgtggtct 
agatgttagt 
taggatattt 
gcgtctcgct 
caccatctcc 
ggcacccacc 
ttggccaggc 
gtgctaggat 
tgtctgaact 
tagtagacta 
atttgggagg 
acatggcgaa 
ctgtagtccc 
ggttgcagtg 
gtctcattaa 
tgtcttgatt 
tgaatccctg 
gttttaagat 
tgattgggcc 
gctggaagcc 
agaaacacac 
aaatgtaaga 
tcaagaccag 



82440 
82500 
82560 
82620 
82680 
82740 
82800 
82860 
82920 
82980 
83040 
83100 
83160 
83220 
83280 
83340 
83400 
83460 
83520 
83580 
83640 
83700 
83760 
83820 
83880 
83940 
84000 
84060 
84120 
84180 
84240 
84300 
84360 
84420 
84480 
84540 
84600 
84660 
84720 
84780 
84840 
84900 
84960 
85020 
85080 
85140 
85200 
85260 
85320 
85380 
85440 
85500 
85560 
85620 
85680 
85740 
85800 
85860 
85920 
85980 
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cctgggcaac 
ggtgcacacc 
ggctgtcgag 
tgagaccttt 
aaagtttcgt 
gcttaacttc 
gagcttcagg 
aattattaag 
taattctaga 
aagtaacctg 
ttttccatag 
acagacttga 
atttgtattt 
ttctgattct 
atatctatgg 
caaatattct 
tactcaaaaa 
caaatataac 
aggttctctc 
cctatttggt 
gtccagttct 
gataccttaa 
attttttaaa 
cagagcttga 
aggatatatg 
acataaaaat 
gtttagggtc 
gttcataggc 
aacccaggta 
atcatggcaa 
ctcctgtatc 
tggcctgtta 
tcctctatat 
cagatcagtg 
tgagggatct 
ctcccgtcac 
gattctacat 
agaaataaag 
caccatccat 
cctgtattag 
gcacataact 
gttggtagaa 
gaggtgatta 
tttatcttca 
aagcgttaat 
aaatatttaa 
catcccttcc 
gaaattctat 
atctttaaag 
tatttggaat 
atatttacat 
atctggaagc 
ttacaactct 
cctagtttgt 
agcaataaaa 
ctctaagtga 
atattctatt 
agatacagga 
tttcctctag 
acacccatgt 



ctggcaaaac 
tgtggttcta 
atcacagtga 
tctcaaaaaa 
ttggacttaa 
ctttgctaag 
gtcatctacc 
tgcttatgtg 
aaggtatgtg 
accaagaata 
cccattttca 
tttcctcagc 
aatatttctt 
caattttttt 
ttttataaat. 
actagtgtat 
aactaaaaat 
ttaccatgtc 
tccatgggga 
tgtcggaaaa 
tttgtaaggt 
agttttactt 
actttaagca 
gaatttgaat 
tgtttcttgg 
cctaatggca 
agttcagtct 
ttgatatgaa 
gagaattcct 
aatgaagtgg 
tattagaata 
ggaattgggc 
ttatagctgc 
gcagcattag 
aggttgcact 
acccagatgt 
tatgatgagt 
tgcacaataa 
ggaaaaattc 
aagaaaagac 
gcatggaaca 
taatcctcag 
agaagaaaac 
tttcctgcct 
gtggtcatta 
tcaacagaat 
tcaactctaa 
ttggattttt 
tacctttatg 
aagattcttg 
tattattaat 
ttccaggtac 
agttgtggta 
taacaaaaca 
attatgacat 
aacaaaattt 
aaatttcgtt 
gcaaggggaa 
ctccagttct 
cccttctctg 



cttgtttcta 
gctacttgga 
get gaga ttg 
aaaaaaaaaa 
tatgaaactc 
aaaatagagt 
aagacagaca 
ccagatactg 
tttgatccat 
cgttgtagtt 
cttaggttcc 
gaagtcagat 
tcctgttggt 
attgtgaaat 
gattataaaa 
gagtcaattt 
aaaatttttc 
ataattcccc 
tgattttctt 
cagctgtcgt 
aaggacttgg 
tcagaaacta 
atatgactta 
gctttttttt 
tgcatggaaa 
ctcaagttat 
gaggcccacg 
atatttttgt 
gtctgttctt 
tccaatgtac 
aggatcccca 
tgcacagcag 
tccccattac 
attcttctag 
ccccttatga 
gactgtctag 
tgtataatta 
atgtaatgca 
ttcctgaaac 
acagtgeact 
tctctccctg 
ttagacaaac 
ctcttaagat 
ctttcctgtc 
aggaaaaata 
ccttggtttt 
agtgagcctc 
aaaaattata 
ggttttttct 
taggtaattc 
gtaattaatg 
ctattgtcag 
gagaggggac 
cacatacaaa 
atattttgat 
attgaaacca 
atttagcttg 
atgtggtata 
gecttctgag 
aatccttttt 



caaaaagtac 
agattgaggt 
caccactgca 
aaaggcaaaa 
atttctagaa 
agtatactag 
atttgttgtc 
aagtttatat 
tttcaagata 
gtagagctgg 
attagtttat 
tactcatttc 
tttcattttt 
ataacatacc 
taaataccca 
ccatggcaaa 
taaatgtgta 
ccacgctttc 
ttgctgccca 

gggtgctgct 

ttttttcatg 
tgetatttae 
taaaacaaac 
cttgtaatgc 
aatgtttctc 
aacaatgatt 
ggttacaata 
taatgagacc 
taaaaaaaga 
cttaaaataa 
acccctgtgc 
gaggtgagct 
ttgeattace 
gagtgeaaac 
gaatctaaat 
tttcaggaaa 
tttcattata 
cttgaatcat 
tggtctctgg 
gaagagggac 
cagctctagt 
actgatttaa 
gatttccatc 
cctagctgtc 
agtcaaattt 
actcattgcc 
attctttcat 
tttatttctt 
tctaaaattg 
catgagatga 
ttattaatac 
tacttgtggc 
tgagaggtag 
gcaatgtttt 
gtgacccagt 
aaattccctt 
ctgatcaaag 
gattctgagt 
gaaaaccttc 
gcacatatga 



aaaaattagc 
gggaaatttg 
ctccagcctg 
aattaaatta 
atgatgatca 
gagacttcca 
atcatcagtg 
acactttctc 
agaaaactca 
gaatatgact 
tatatatttt 
tttggcacag 
gagtttttat 
ttcagaaaaa 
gtaactatta 
agaactaagc 
ttatatcaat 
ccttctttta 
atagtcagcg 
gttggatttg 
ttgettttta 
aagcaaagat 
tgttatccat 
ctaagactta 
cttatctttt 
tatctaaaac 
agtggtgttt 
aaaactttgc 
acatgetaaa 
aacttaatgt 
cacagactgg 
gtgggtgagt 
acctgagctc 
cctattgtga 
gectgatgat 
acaagcccag 
tattacaatg 
ccctaaccgc 
tgccaaaaag 
tcactgtaag 
tctccttgta 
tatgtagctc 
ctttgtctct 
tttatgctgc 
acatttgact 
gccccccacc 
attttcttcc 
tgtagaaaat 
ttttttaaag 
tttattttag 
ctcatcagat 
tctaccactt 
acaacttatg 
tcaaattttt 
tctgtctctc 
actacatgta 
gctactgaaa 
gtcaagtggc 
tccaacaact 
ttggtatccg 



caggcatggt 
ettgagectg 
ggcageggag 
ttagtatggt 
tttgeatagg 
gagctgeata 
ttaaactcta 
taatctttaa 
ttcagagggg 
catgtctgtt 
agaaggactt 
aattatgatt 
ggaatctgac 
aagattaaac 
tccaggtcag 
ttaggcacta 
ggaataaata 
cagcatgggt 
tcttcacaga 
ttgggctcat 
aaaactgtta 
cctccttttc 
agcagcaaat 
gacatcaatc 
tcttctattc 
ageagectta 
taaagtagct 
catttattcc 
attttaaaat 
caatgtactt 
tacgggtcca 
aagcaaagct 
cacctcctgt 
actgtacata 
ctgtcacttt 
gattcccact 
taataataat 
cctcccccgg 
attggggatt 
gaacggatgg 
agtcccttgc 
tggctaacag 
actttagtgg 
tttagttgaa 
ttttattttt 
ccccaacaca 
atctaattta 
agatattttc 
aaaaaagttt 
cagcaacata 
agcttctttg 
gccgaatgta 
taatctacta 
ctgaccactg 
tttctctacc 
atattctcat 
cttgagagca 
aggtccattt 
taggtcaatc 
acagccttac 



86040 
86100 
86160 
86220 
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86340 
86400 
86460 
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86580 
86640 
86700 
86760 
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87000 
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87180 
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87300 
87360 
87420 
87480 
87540 
87600 
87660 
87720 
87780 
87840 
87900 
87960 
88020 
88080 
88140 
88200 
88260 
88320 
88380 
88440 
88500 
88560 
88620 
88680 
88740 
88800 
88860 
88920 
88980 
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89100 
89160 
89220 
89280 
89340 
89400 
89460 
89520 
89580 
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tcatttacat 
agggaaaacc 
atacttcact 
agtgatacca 
atatacactt 
attgttaaat 
aggtggaaaa 
attatttggg 
cacatttttt 
ttatcatagc 
ctgagtagct 
tttttttaaa 
gccttggcca 
agatttttaa 
ctaaacaaag 
aagcctagaa 
ggagaagggc 
ttccttatga 
taaccaactt 
aaaaaacttt 
tcctttatct 
ttattctttt 
acttaattta 
gaacagtgca 
aactgcacct 
cttttctttt 
tcatagttaa 
aattgaatca 
agcacagttc 
acaaaatcac 
tttagtgcta 
ttttggttaa 
agattttatt 
ctttatacag 
gctcctttgc 
gactggtgaa 
tcttaagggt 
tcatgtttgt 
ataaagagag 
aaaaaatttt 
taccacttgc 
actgatcatg 
agttaaagtt 
tctctcaaga 
gtgtgctttg 
ggtcccagct 
tagacaacat 
tgttttttaa 
taggtctctt 
accataagta 
ttgatacaaa 
tcagaggaat 
tataggaaat 
gcaaaattca 
cctcacgcct 
gcaacataag 
agtgtctaaa 
ttttttcttt 
ggcacacata 
gagatttgtg 



tgcacttatt 
tggaagtgaa 
ctgcaggagg 
ctgtacagga 
tgaagaaggc 
aaaaacaaat 
actgatacag 
atagatgtct 
ttttttttga 
tgactgcagc 
aggactacag 
tagagatggg 
cccaaagtgt 
tattcttttc 
agtggagatt 
gtgattcttg 
atctcagaga 
ttacctcagc 
taatcttact 
aactagtttt 
tcctaagaaa 
gacacatgca 
agtctttctg 
gtacagggcg 
ctgtgcaaat 
actaccttgt 
ttctaacttt 
ctgttgattt 
tcactcagtg 
tgatgacctt 
catgcttgac 
cttttacttg 
ctaccaagca 
ttccatcgag 
ataccagcct 
tggcattgtc 
tctgattgac 
tctctttctg 
tggaggtagc 
caattttgcc 
actttttgga 
aatattacta 
ggcctcaaac 
caatgtcagc 
gtattttaaa 
actaagctga 
agagagactc 
caggaaggct 
tttcttcata 
aattgttttt 
gaaacattac 
atctgagaaa 
taggatttca 
gctttgttat 
gtaatcccac 
gagacttcat 
gtatctgccc 
tgtggctgaa 
tttgtatgtt 
caatttttct 



tggctgccaa 
aagggttcag 
gagaagctct 
actgatgttc 
agagagaaat 
aagcacttta 
aatgaatgaa 
tgtttcttta 
gacagggtct 
ctggattcct 
gcatgtgcca 
gttttgctat 
tgggattaca 
cacaatgtat 
cagtaaaggg 
agcctgattt 
gaggattgcc 
atgttgctag 
ttctaacaaa 
attccttacc 
tgtgcaatta 
tggccttaca 
tagtcagaat 
aaggttggtc 
gttcctgaca 
ttggtaagta 
tttaaaaaat 
tacccttcct 
ctgtgtgcgt 
aatggttatt 
ttactgagtt 
tagaaaatat 
cacagtaaca 
cctctgtacc 
tcattggtca 
actgctggca 
agcgtgggcc 
gctggcttta 
ggatcctccc 
atcttcaagg 
tactttgtgc 
tttaataaag 
attatcctgg 
acatccatta 
aaccctgcaa 
ggcaggagga 
tacctctaaa 
aaaatagtgg 
tggcttttat 
caaccgaagg 
atctacttgt 
agttactcaa 
agtgctcaag 
actgaaatac 
cactttgggg 
ccctacctgg 
ctggcaacgt 
aatgtattta 
tgtgaacttg 
ttagttaaga 



acgtcacaaa 
cagtagtgca 
gtggttttcc 
ctgatgattc 
gtataataga 
agtaagttac 
gtattaattt 
agcagactat 
cactattact 
gggttcaagc 
tcacacccag 
gttgccagac 
ggtgtgagcc 
ctggtacaca 
tgatcagagt 
gaaggtggtg 
aacatgcctt 
actagcatta 
ttgtttgctt 
ataattgttt 
tttggttaaa 
gcttacaaac 
atgatttctt 
tacagccctt 
taaccttggg 
cctggaaaaa 
gtttcattga 
ggcaccaacc 
cacctgaaat 
tttcacatat 
tttccctcta 
gttgatgaac 
atattgaaag 
ttacctatgg 
ttttgggaca 
gcagtgtctt 
tcttttacac 
cttaccgacc 
tcttttccag 
tgacagctta 
cttatgttca 
aaaaacttct 
ttgtaatttt 
gaccactaaa 
acccagccag 
tcacttgagc 
aataaaataa 
aacaaattac 
cagggacaaa 
gtgtaggtaa 
ggtacacttc 
gatctaaatg 
aagtttatat 
gaataattaa 
ggctgatgca 
ggaaggaaaa 
ttgttcaaaa 
caattcaccg 
gatttttttc 
aataagtata 



ctggaaccat 
aataccatca 
aactgagagc 
tgctgtgaac 
cttaaatttt 
aattatctgg 
ctgttttgtt 
gaatatcttg 
caggccagaa 
cgtcctcctg 
ctaatttcag 
tggtctcaag 
accacgtctg 
gtagttgctt 
gaggtgagat 
ctagctgtgg 
aattttatca 
ttatccaaaa 
ttactactga 
caaagaacat 
ctgtaagatt 
tgggatcact 
gttgtcttgc 
aggccagcaa 
gaaaaaatat 
ctccatgaaa 
gactaggttt 
tttatttctg 
ccacagaaag 
tcagattaaa 
ttttggttaa 
aaaaacccac 
ctgctttcca 
aatcatattt 
ctatttcaag 
cacaatcctg 
attgagggtg 
tcttgctacc 
gaaaaagttc 
tgcagtgtgg 
cttggtgagt 
ttgaagagaa 
ggtattcttg 
cagagagagt 
acaccatggt 
ccaggagttc 
atgtccccaa 
aatcagtata 
gctagcgcta 
ttagcaaaaa 
tgggaaaatg 
aggaaagaga 
tgtttatttc 
tgtctagact 
ggagttcaag 
aaaaaaaaga 
gtgttcatta 
taaatgataa 
tagcttacag 
aatataaccg 



gtgttactga 
taaagctcat 
attacagtac 
agtattttta 
tttctttaaa 
aaaactactt 
ctgtgttatt 
aaggcagaac 
tgcagtggtg 
ccccagcttc 
ctattttttt 
ccatcctcct 
gccaaggacc 
aatatgttgg 
taatttggga 
attaatagag 
gattctagag 
ttttaattat 
tagccttttc 
aatgatatga 
atttaatcca 
aaaggaatac 
acaatactga 
aaacaggcac 
aaaatgcggc 
taattagatt 
ttggtttgtt 
agctgtggag 
aggtggctga 
ttaaaatacg 
tttttttttt 
ttatactata 
tctttttcat 
gcctgcggct 
aagcgccttg 
ctgcctttgc 
ctctgcatct 
agtaccaaag 
agtcctccaa 
gcagttggaa 
atgctccttc 
agttaggtcg 
aaatgaaagg 
atgtttcata 
gcctgtctat 
gaatccagcc 
acaaacacaa 
aaacatttga 
tgattttgct 
agccatgatg 
ggaattctat 
actatggttt 
tatttcaaag 
ggggttggtg 
accagcgtgg 
aggaagaagc 
tgtttcttcc 
aaatggcatt 
tctacttttg 
atttacggac 



89640 
89700 
89760 
89820 
89880 
89940 
90000 
90060 
90120 
90180 
90240 
90300 
90360 
90420 
90480 
90540 
90600 
90660 
90720 
90780 
90840 
90900 
90960 
91020 
91080 
91140 
91200 
91260 
91320 
91380 
91440 
91500 
91560 
91620 
91680 
91740 
91800 
91860 
91920 
91980 
92040 
92100 
92160 
92220 
92280 
92340 
92400 
92460 
92520 
92580 
92640 
92700 
92760 
92820 
92880 
92940 
93000 
93060 
93120 
93180 
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tatcaggcta 
ccaacttaag 
caacctcttt 
tgagtagaaa 
aggctggctt 
cagctttaaa 
actcagcttt 
atttttggtt 
ttcacaataa 
aacagatcta 
agcagagcag 
tcttaatagc 
gggaaaaatg 
atttcatagt 
aaatcagatg 
ttttctgctt 
tgagattttc 
aaagatcaga 
gtttgtcact 
ctagtttcct 
gatgatgttc 
gtaaaaattt 
tataaagcac 
tgtatcctct 
cagccaggca 
atcacttgag 
aaaaagtaat 
tgggaggctg 
atggtgcgac 
taaagtattc 
agtggcagcc 
tctgtagtcc 
aggctgtggt 
tgtctctaaa 
actgataatt 
gtgaaggttc 
aaggcaggtg 
cctgtctcta 
gctactcagg 
gctgagaccg 
aaaaaaaaaa 
tactagttgc 
aacccaattt 
tctggcataa 
aagtaaattt 
atagcattgg 
ctagggaact 
cattgctatg 
tatgtatgct 
aaaagaagaa 
ttaaaatctt 
ctaaaaacga 
agttgttttg 
gttttcgttt 
caatcccggc 
ccctagtagc 
agtagagaca 
tccacccatc 
tctgggtttc 
tcatttagct 



catcctgatc 
gtaagtttcc 
catagcttct 
ttctttttct 
caagcaaccc 
aaaaaaattc 
gctaatacta 
atatatttat 
accagtaaag 
ttgttaatag 
gagttagttc 
atattgggcc 
agtatgccta 
ggaaggtgag 
gtgcacagaa 
tcccttcttc 
tctttctcat 
tagctaaatg 
ttgttgtaat 
ccttatacag 
aattcctact 
gctctgaagg 
ttttacatac 
ttatttcatg 
cagtggctca 
cccagcagtt 
ttaaaaaaat 
gggtggaagg 
tgcactccag 
taaaggaaga 
ttttaaatat 
taactactca 
gtgctatgat 
aaacagataa 
attactgtag 
tgggcctggt 
tatcaccaga 
ctaaaagtac 
aggctgaggc 
cgccattgca 
aaaagttctg 
ataaccttgg 
tgcagagttg 
aaatgggagt 
tgaaaggaga 
agttgggcct 
aacttaccag 
ttaaaaattc 
tgctttcaaa 
ttagattgcc 
ttttagaagg 
ttcctagtag 
ttttgttttc 
ttttttgaga 
tcactgcaac 
tgggattaca 
gggttttgcc 
ttggcctccc 
tttttttttt 
ttcaggagca 



tgatagtcca 
tggaatatag 
gtagtcaaga 
tttttttctt 
tcctgcctca 
tctttggtac 
gacatttacc 
aatttgtaaa 
cagataaaat 
agacgttaag 
tcttccactg 
acttccctgg 
aggaaattta 
ttagacaatg 
caataaagtc 
caggcatgtt 
aacggtgccc 
ccatacaggt 
ttaaggtttg 
ggaaaaggtg 
aaacccttta 
agcataaacc 
attctcttat 
gaagggaaga 
cgcctataat 
caagaccagc 
tatctgggca 
atcgcatgag 
cctgggtgac 
acagattgaa 
ggaaggtgaa 
ggaggctgag 
tgtgccactg 
taaataaaga 
tacttttatg 
gcggtggctc 
ggtcaggaat 
aaaaattagc 
aggagaatcg 
ctccagcctg 
aagtaagaca 
gcatcctctg 
tgaggattag 
tattttaatg 
aagataatac 
tgagcatgag 
tttaggtctt 
taagatctta 
gtgaggttgg 
tcctcgaagc 
atacaaatct 
cagctgcatc 
tggaatcttt 
cggagtcttg 
ctctacttcc 
ggcgcctgcc 
acattggcca 
aaaatgctgg 
tttttttttt 
taaactctct 



ttttcatact 
atctgttgtg 
gaacatttat 
ttttccagac 
gctaggatta 
taccacatga 
aaaggaaaag 
ttaatttcac 
aaatattagc 
tgattttcca 
tgcttacctg 
atatattacc 
ataagcatgt 
ttatagatac 
tctgttaatt 
ttcttacaaa 
gttatcgcac 
gtttaaatct 
aattatggat 
tctcatatcc 
ttgactgttg 
tagtacatgt 
ttaatattca 
ctaaggcccg 
ctcaccactt 
ttgagcaaca 
tggtggtgca 
cccaggaggt 
tgagtaagac 
caatttttaa 
gaaataaaga 
gcaggaggat 
cacgccagcc 
agtaacttgc 
taagaggcag 
acccctgtaa 
ttgtgaccag 
tggacgtgat 
cttgaacttg 
ggtgacaaga 
gatctggatt 
taagcatcag 
ataaaatgta 
taaggcaatg 
ccatttggaa 
attttgtgta 
tgaagattca 
ttagattgta 
aggtttgatc 
taaatttacc 
tatagatcaa 
agtttttatg 
ttttgttttg 
ctctgtctcc 
cggattcaag 
accacacctg 
ggctggtccc 
gattacgggt 
ttttcttttt 
tggttttctg 



attaggaaag 
acaggattaa 
tgtgtccttt 
agggtcttgt 
caggtgcaag 
acacacctag 
tggtagatga 
atatattact 
tccaatttta 
agaattacat 
gtagcaaaat 
aaatgtgtcc 
tatttcttca 
ttttgtgatc 
ctgttaataa 
atatgttgac 
cgaatgcagc 
cctctttggt 
acttaaccag 
ttcaaaagac 
aggggacaca 
aattaaaaat 
caacaatgca 
gaaagattaa 
tgggagaatg 
tagtgagatt 
tgcctgtggt 
caaggctgca 
cctatctcta 
tttatttgtc 
gccagatgtg 
tgctggagcc 
tgggtaacag 
ttgaggtcac 
tattgtatag 
tcccagcact 
cctggccaac 
tgcttgcacc 
ggaggcagag 
gcgaaactcc 
taaattcagg 
tttcctcatc 
tgtgaaacat 
tgattgccaa 
aagtggtttt 
caaatctgat 
gaaatacaat 
catgatgatt 
ttctcgtagt 
tttcttttag 
tttagatgag 
aattgcccct 
ttttgttttg 
caggctggag 
tgattctcct 
acttaatttt 
gaactcctga 
gtgagccacc 
aacggctcct 
cctacctcca 



tatagccgaa 
ctttaccatc 
cttaaaaaga 
taagttgctc 
ccaccacacc 
agaaatcata 
ctgtctagtt 
tcatttgact 
cagactgaaa 
gtcagtaaac 
cagtctacag 
atcttattag 
ggtaaataaa 
aggagatggc 
accatgcctt 
attgttcatt 
acggtagagg 
tatgtactga 
gaatgggaca 
tagtaaagta 
tatatgagac 
ggctacagtt 
gtacctgtgg 
ataacttgct 
aagtggaagg 
ccatctctac 
cccagctact 
gtgagccatg 
aaaaaaatta 
tcctcctcct 
gtggtacaca 
caggagttca 
agcaagactc 
agagatagtg 
tggtttaaaa 
ttgggaggcc 
atggtgaaac 
tgtaatctca 
gttgcagtga 
atctcaaaaa 
ttttgtttct 
tatggagata 
ctacctcagt 
cttgagatag 
aaaaagtttc 
ctttgatcaa 
ggagtgctct 
tgagagagaa 
tgacgtttca 
gccttcccac 
gcctaacttt 
tttgcctgag 
ctttgttttt 
tgcagtggtg 
gcctcaacct 
ttgtattttt 
cctcaggtga 
acgcctggcc 
ctgactcctc 
catcactcct 
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ccttagtttc tttgctcact tcttcttttt cccactgacc cctgaatatc agcatgtcct 96840 

agggcttgtc ccctgatctt tttctccatg tattctactg gtggtttcat ccagtctcct 96900 
aagttcatac atcacgtata tgtcaatgac ttcaaattta taattctggt ccagaccttt . 96960 

tccctgaatc ctccaccaga gctgtatatc cagctgctta cttaacatct ccacttgggt 9702 0 

aactgctagg tgtttcagac ttaccctgtc taaccctgag gtcttgatct taccccttaa 97080 

aacttactct gcccccagcc atcctcatct caggagctgg caattccgcc ctttcagttg 97140 

atcagactca aaactttgga gtcatccttg gctcttcttt cttgcacacc atagtcctga 97200 

tccagtgaag aaatcctggt ggcttttctt tcaaaatata tccaggatct gaccacctct 97260 

caccatcctc actactcata ccttagccca ggctaccacg tacccctagc ctggatcact 97320 

gccagagcct cctaactggt ctctctgttc cttctctgcc ccgcggagtt tgttctctat 973 80 

gaagaagcca caggcattct ttctaaacat aagtcactct gctcagaatc cttcaatggc 97440 

ttcccatttc cctaagagta aaaaccaata tccttacagt gacctacaag gtccttcaca 97500 

atctggcccc cactacctct ccgagcttcc atcgctgtcc cttgcccact ctgcttctgc 97560 

cattcgcctt ttaatggggc tcactctgac tacctgcttg aaacttcctg cgtccctttt 97620 

cccctgagta ttcacaaacc gctcctagta ctccttttct ttttttgtag cacttaatac 97680 

tttctaacat tatctatttt acttctttat tgtagtcatt gcttactatc cgtatattta 9774 0 

cacgtctgct agaatgtaaa caccacaagg gtaaggatct atttcattca gtggtagatc 97800 

ccaagcatct agcacagtgc ctagcacaca ctgggtgctc aaatatttgt tgaatgacta 97860 

aatatattct gggtgagtct gaagtgacac tgtataagta atgttcattt tttcatcatt 97 920 

tggatcttta aaattctcta ctttgatgct ataatgattt ttcacattct gtacttgcag 97980 

gacatggtgt tattaatatt tattcaatac ttattcaaca aataagctca aactaaggaa 98040 

acctcggaat aattgagtaa ccagtaatgc tgtccgttga tggaggagag agttggtgtg 98100 

ttttgctctg attcacttat gcctttgctg aaattttaag ataaatagaa gaaatttctg 98160 

gtccctcaag taactgtgtc ttcagtaccc actgaaaaat ctcaaagagt ctggagtggt 98220 

gtgtttaaga ataggatgca ggatgcagaa ccataaccag gcctcaggtc tgcatagctt 98280 

tggtcgagca ttgagcatag ggcctcgtga gataactgat aaatgccaaa tatgacaatg 98340 

ataaatgcca aatatgacaa tgataaatgc cgaagaatga cagtgacaat gataatgaag 98400 

ttaccaaaaa tgatggtaac ttttctcatt ggcatgaaat gctctatctc caatctgaag 98460 

ctgatgatgt agtttcagtt actctcatct ctctcccctg ctactcagat tgaaaatcag 98520 

ctacttagta cctgtgttct ttgactctag accatatcat tgggtcaaat ttcagttttt 98580 

aaattttaga tccacatggt tctctgtcaa gaagatgact gactcatatt gaaatctgta 98640 

aaatatgtat tcattagcct gttttttaaa aactccctta taagtgggtt gactttgtgg 98700 

cagatagtaa ttgactgttc tcaaaagaaa ctttgacctg gtaggaagat cccatttacc 98760 

tgatgctatg gttcaagaca gacagatcat ttgcttgcta gcagggcaat taggtgaact 98820. 

tcaagtccac tagtaattgg aaatgatttt tttttttttt tgagactgag tctcattctg 98880 

tcgcccaggc tggagtgcag cggcatgctc tcggctcact gcaaccttca cctcctgggt 98940 

tcaagcgatt cttctgcctc agcctcccga gtagctggga ttacaggtac ctgccaccac 99000. 

gcctgactaa tttttgtatt tttggtagag atgggtttca ccatgttggc caggttggtc 99060 

tcaaactcct gacctcaggt gatctgtctg cctcggcctc ccaaagtgct gggttatagg 99120 

cattaaccac cgcccctggc catgaattgt atttttaaac cagaaatgaa aatttgagac 99180 

taataagtca gtacagggag catgtaaacc tcgaaaggta ttttttagct ttgagtagtg 99240 

ccagatgctg ccaagggtca atcaacactg gaatgtagct attagacctt gctaggcaga 99300 

gcacctccat ttacactgtg gtcagagcag cagtactgct ccaagccaga gctaagggcg 99360 

ctgagccacg caaataggaa cagcatacaa gccttcatct ctctgtggct tcctcagagg 99420 

gagattcatg taacatttgc caagaattga ttatgtgtca agcacttccc caaaatctca 99480 

cagaaccacc acaaggatga gtgtaataaa taacacatac ttagagccaa ggaaacaatt 99540 

ctgcaaagct gtgcttgttc aaagccattc gcattatgct taaagctggg atttgaacac 99600 

aggtttcaga caaatatgtc tgaaatatac tctttttatg aaggagtctg cattccttca 99660 

ttgctaatcc agagatagga gtgctgctat tttcagccat actgggccta caccaaagat 99720 

tgctttgcac gtttcccttc tgttctctca gaacgaagaa cagaggccat gttgagctgt 99780 

tccagcgctc agagcatgct tcacagccag ggagaaaact ctggaggaaa ccagcttttg 99840 

ttttgatata attaatggga atgagaaaat atctataccc ttattttcag ccccaacttc 99900 

tcttttgatc tcaagtacat tgtgaatatg agaaaactga ggccatgcag ttacttttca 99960 

caacctgtga caagcagaac atggaccata catagctttg tgttcaattt tgctttctac 100020 

agtaaacatt aagcataaca gaagaacaaa aatggacatg tacaaattta tagcaagatc 100080 

tatcctttat ttgattaaca taaatactat tgcaggaaaa tggaaaaagg taaactgctt 100140 

gaaatttagt cacatataaa cgctccgagg ccactggtgg atcattagtc tcctgagaga 100200 

gctctaaaga attagtgtgt tggaaaactg ttccctcctg ttaatgtgta aatttcacca 100260 

gtgggttttt ttttttttta agacagggtc tcactctctt gtccaggctg gtatgcaggg 100320 

gtgcaatcac acctcactgt agcttcgacc tcccggactc aagcaatcct cccacctcag 100380 
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cctcccaagt 
tgtagagatg 
atcctcctgc 
ggccttcact 
cagtatactt 
ccttggacta 
agataagggt 
gaatagaaat 
ttttgcctga 
agtacatttc 
actctaagga 
aagggtcttt 
cagctgccca 
atatctctgg 
tattccctca 
ggtgtataat 
atccatgtta 
ctagtcttca 
ctttgaaaac 
gatttaacat 
atctctcatt 
gcaagaaaag 
catggtgcag 
cttaagtgcc 
caggtagttg 
ggaatagtta 
acccaagaac 
acatagaaac 
tcctaaccct 
gacatgatcc 
ttcatggatc 
tgcaaggctc 
ggctgctgaa 
ttatctctgt 
ccagcctggt 
ttacgtaact 
cttaaaatat 
tttactttct 
ctccctccct 
agaatgcttt 
caaatttaag 
tgacacaagg 
cttgaactcc 
ggcatgagcc 
tcatttgtgc 
tgttatgtgg 
ccaaaaaggc 
accttttata 
ggctcagaga 
tagaatagat 
aattataaat 
cattctagat 
acaagtaaat 
acaaattgag 
acaacctcat 
ctttgaaggt 
tctcaagtct 
tgtattatta 
ccaatcattg 
ttacctgaga 



agctaggacc 
gggttctcac 
cttggctccc 
aggtttttca 
tgaatcatct 
ataatcaaat 
cctttcatgt 
agaaatctta 
tatttacaag 
ttaccttcaa 
tcatagagca 
agcacccaaa 
tgtagcatct 
ctcttttgcc 
taacatattg 
tgcaaacatg 
aattgaactg 
gagcatagtt 
cacaggtttc 
tttagttaac 
ggagtctcag 
aaccagattt 
ccagggcttg 
tgccaaaaaa 
cactcctccc 
agttgcttgc 
atccatggaa 
agaatgtgta 
ggtcaactca 
attttctcag 
tatttggagc 
acagtagtat 
ggtgacctct 
tcactataag 
ggatgatggt 
gaagagttgc 
taattttgtt 
ctctccctct 
tctttccttt 
gctactttgc 
taattttgtt 
atatatatat 
tggtgtcaag 
actgtactca 
tgattaaatt 
cctggaaatg 
tatgaagata 
aggaaagcag 
ggtttaaaag 
ttccagttta 
cagttaaaca 
gtggggatac 
tcataaagaa 
atggaaggtg 
tgaaaagctg 
acaaaaataa 
ctaaggccaa 
aaaatctgca 
ttttttcttc 
ccaggtaatt 



acaggtgcac 
cacattgccc 
tgccaaagca 
ttttgttttg 
caaatttcat 
taaagtttgt 
ttttatatca 
atttgaagaa 
gtggctttgg 
catctagggc 
taagggtaag 
caaagttcta 
atttacccct 
cctctctcct 
taaaagtgct 
gaatccctat 
atgctttttt 
taagattttt 
ttttagacaa 
atttttatta 
gaccactgtg 
ggggtggcga 
agcttgggtc 
gtataacatt 
cctgccctac 
caaaagccac 
aataaatgga 
tgtagaaaca 
tagggttcat 
agttgccaag 
tcagggacaa 
ccttataaac 
ccttcatgtt 
gttctgacag 
tacagagtgc 
cctatctcag 
tttagcctag 
ctccctctct 
ttcctttttc 
aaaataaaga 
tttctatttt 
atatttagag 
tgatcctcct 
gcctttatta 
tactctcaat 
tttcctttat 
tttatgcaca 
tgctaatagt 
gtttgcctaa 
ttctaatggc 
gattcaggca 
aatgctaaga 
acaaagtagt 
atggtaagag 
atagtaagta 
gacagctttc 
tcaagagttg 
tatcaaaaat 
atatactgca 
taggaagaaa 



accaccacac 
aggctggtct 
ctgggattac 
catgtgtctc 
ttctaatatg 
tcaagtttga 
actagaaaat 
tcttcccctt 
gttgtggaga 
atcctttggg 
caggccttct 
ggggctgtac 
atgctttccc 
tgggcagctt 
catttaaagg 
atctctgtta 
gaagtaaaat 
gcccaatcct 
atacaacatt 
acattttagt 
tgaaatgggc 
aacaacttgc 
ttcttaaaag 
aacttaggac 
aaaaaaagaa 
agttataaaa 
agcttattac 
aaattattag 
ttcccaatca 
agcaattgca 
ggttgttatg 
caaagttcac 
ggttggttta 
aagcaaagtc 
tggaaacttt 
tcagcactgc 
agtcttaaat 
ccctctctcc 
ttaactatgc 
gtaatcactt 
tcattcaaaa 
acagggtctt 
gtctcagcct 
atctaaatat 
ccctatatgt 
tcattcattc 
tatacatttt 
ggttcttcta 
ggttgcacag 
aaccttaatt 
ttcatttgag 
acagatactt 
tttcaaatag 
atggtaagag 
catcctttgg 
tattgccctt 
gattttttta 
gaaaatgtct 
ttaatctgtt 
aagaggttta 



atggctaatt 
caaacttctg 
aggtgcaagc 
aggttttatt 
gacattggca 
ggaacctaaa 
aaattgtttt 
gtgaggctat 
gagttgtctg 
agaagccctt 
tatgtattca 
attgctgatg 
cactttctat 
acttgtaatt 
gcaatgcaca 
tgcaatccct 
ggtaagaaca 
ccaacccatg 
tatttcccgc 
ctacaagatg 
aatatcaggg 
tcagggttgc 
tgtggctttt 
ctgaaaggca 
aggtaaagga 
gtagcagaac 
agcccagcct 
aggagtgaaa 
ggaggttgct 
actggctttg 
cggcagattg 
tgtacactgg 
tctttgttgg 
ttggtccccg 
tttatttatt 
aaccaattaa 
actagggttt 
ctcccttcct 
agaatgcttt 
tgctttttct 
atagggttat 
gctatgttgc 
tcccaagtgc 
gataatttac 
atttcttgta 
attcccttcg 
atgattcaac 
ctaatagata 
ttaagtaaca 
attgttgttt 
tatctattgt 
tctcatacac 
tgatattttc 
acaactttgg 
gtaaagggta 
gggaggccta 
tccaacttat 
tgcatacttt 
ttcacactgc 
atggacttaa 



tttaattttt 
tgctcaaaca 
cactacatct 
tgataaaatg 
tgtctcaaat 
ttagccaatt 
gatatgggat 
actaaatggc 
atccattgag 
gtagtcacta 
tgctatcagg 
tgttaaccct 
ccctatcatt 
agaaagttta 
ccaaattgga 
gtatctctgt 
gtggcaacat 
caatggtgtg 
atttcttttt 
cttccacatt 
cttctatcta 
aaggatggta 
aaataaaata 
ttgtacagat 
acgaaggcat 
tgggtgtaat 
gtaaatatgt 
ttagtttctg 
ttgccatttt 
ttgtctttct 
tccctgacaa 
gccaggaatg 
ttggttgtgt 
ttacatgtcc 
taaaatggag 
aatagaaagg 
aaaagtttca 
tctttcttcc 
gctactttgc 
tcaagacttt 
aacatttcct 
ttaggctggc 
tgggatcaca 
ccactgagat 
tttttgttgt 
caattaactt 
tctcatggta 
agaacattga 
gagttgccat 
gacttgctct 
gtgccataga 
agctcaataa 
ttcaaaggaa 
ccagtcaggg 
gtataaggta 
taacagaatt 
ttttaattga 
gctgtaggac 
taataaagac 
agttccacat 



100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
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ggctgggtag 
tggtggcagg 
tgagacttat 
cctcccacca 
atttgggtag 
tggcctcaca 
ttcagcatta 
ctgcctatga 
catgcattgg 
caggccccat 
gatcttcttt 
tggccttggg 
tcatgggctg 
atctaccatt 
ccccactggg 
cagaggttct 
ttccatacat 
acccgcagac 
catggcctga 
cactaagtcc 
gcccatgaaa 
aaggtctctg 
tcttgtcact 
atgggttttt 
tgcagaacgt 
tcaacctgtc 
cccctgacca 
aattcccact 
tgagaatgat 
ttatggctgc 
ttgatgggca 
tatgtgtgca 
atgggattgc 
tcttccacaa 
ctccacatcc 
gcgtgagata 
ttttttttta 
cttcacctac 
gtagattctg 
tgtaggttgc 
aattagatcc 
gaagtctttg 
tatggttctg 
tgaacacttt 
caaagttcca 
tgtgaccttt 
ccattcaaca 
tccaagtctc 
agactgttcc 
ctttacagca 
ctaataaaga 
agttccatat 
atgtcttaca 
tcagatctca 
gattcaatta 
tcaagatgag 
atctcttcta 
gatgaagtct 
cacatggcct 
gttattacag 



gcttcacagt 
caggggagta 
tcactgtcac 
gcttgctccc 
ggaacacagc 
tttcaaaact 
actcaaaagt 
atctgtgaaa 
gtaaatacac 
gcaagtctga 
gattccatgt 
aagctctgcc 
gcattgagtg 
ctggggtctg 
gactctgtgt 
tcatgaggtt 
cctctgaaat 
tcaacaccat 
gctgtacctt 
ctaagctgca 
tcattttttc 
acatgccttg 
tatgcaaatt 
ctttttcttc 
gcaggtttgt 
atctacatta 
gccctggaat 
tatgagtgag 
ggtttccagc 
atagtattcc 
tttgggttgg 
tgtgtcttta 
tgggtcaaat 
tggttgaact 
tctccagcat 
gtatctcatt 
tatatatttg 
tttttgatgg 
gatattaacc 
ctgttcactc 
aatttgtcaa 
cccatgccta 
ttgcatcgtc 
gctgcttaga 
cagatctcta 
actccagttc 
ggtctctagg 
taggaagttt 
aacctctgcc 
gcaccccact 
catacccgaa 
ggctggggag 
cggcaggagg 
tgagacttat 
agtcccacca 
atttggatgg 
ataatacact 
cactatgttg 
cccaaagtgc 
tgaatgatta 



catggtggaa 
tgtgcagggg 
gagaatagca 
atgatatgtg 
caaaccatat 
aattatgcct 
ccacagtcca 
tcgaaagcaa 
ccattccaaa 
aatccaatag 
ctcacattca 
cctgtggctt 
tctgcaggtt 
gaggacggta 
aggggctcca 
cctgcccctg 
ctaggtggag 
gtggaagctg 
agcccctttt 
cacagcaggg 
cttctgggcc 
gggacgtttt 
ctgcaaatgt 
ttcctttttt 
tacataggca 
ggtatttctc 
gtgatgttcc 
agcatgtggt 
ttcatccatg 
atggtgtata 
ttccaagtct 
taatagaatg 
ggtatttctg 
aatttacact 
ctgttgtttc 
gtggttttga 
ttggctgcaa 
ggttgtttgt 
ctttgtcaga 
tgatggtagt 
ttttggcttt 
tgtcctgaat 
aggctgcaaa 
aattttttcc 
gggcagggac 
ccaacaaatt 
aagttccaaa 
caaactttcc 
tgttacccag 
acccagtacc 
actgggcaat 
gcctcacaat 
caagagtgtg 
tcactatcat 
ggtccctcct 
gaacacagcc 
tgttctcatg 
cctaggctag 
tgggattaca 
agacaccttc 



gatggaggag 
aactgccctt 
caagaaaaac 
gggattatgg 
cattctgccc 
tcctaatagt 
aagtctcatc 
gttagttact 
tgggagacat 
ggcagtcatt 
gggcacattg 
tgcagggtac 
ttccagatgc 
gccctcttct 
gtcccacatt 
cagcaaactt 
gttcccaacc 
ccaaggcttg 
agccatggct 
gggccacaca 
tccaggccta 
caccattgtc 
ctgcagctgg 
attattatac 
tacatatgcc 
ctaatgctat 
cctccctgtg 
gtttggtttt 
tccctgcaaa 
tgtgccactt 
ttgctattgt 
atttataatc 
gttcaagttc 
cccaccaaca 
ctgacatttt 
tttgcatttc 
aaatgtcttc 
ttttttcttg 
tggatagatt 
tccttttgct 
tgttgccatt 
ggtattgcct 
ttttccaaac 
accggatacc 
aaaatgccac 
cctcatctcc 
ctttcctaca 
cacattttcc 
ttccaaagtt 
aatttactgt 
ttataaagaa 
catggtggaa 
catgcagagg 
gagaacagca 
aggacacatg 
aagccatatt 
tggttattct 
tctcaaactc 
gatgtgagcc 
tctccttcac 



gatcaaaggc 
tataaaacca 
ctgtccccat 
gagctacaat 
ctggctcctc 
ccctcaaagt 
tgagacaagg 
tcctagatac 
tggccaaaat 
aaaccttaaa 
atgtaagagg 
aacccccctt 
acagtgtagg 
catagctcca 
tcccttccac 
ctgcctggac 
tcaattcttg 
gggcttgcac 
ggtgcagcta 
gcagggggac 
tgatgggaag 
ttggcaatta 
cttgaacttc 
tttaagttat 
atggtagttt 
ccctcctgta 
tccatgtgtt 
ctaatcctgt 
ggacatgaac 
tttctttatc 
gaatagtgct 
ctttgggtat 
cttgaggaat 
gtgtaaaagc 
aatgattgcc 
tctaatgacc 
ttttgaaaag 
taaatttgtt 
gcaaaaattt 
gtgcagaagc 
gcttttggtg 
aggttttctt 
ttttatgctc 
ccaaatcatc 
cagtctcttt 
atctgagacc 
tcttcctttc 
tatcttcttc 
gcttccacat 
attagtctgt 
aagaggttta 
ggtggaggag 
aactgccctt 
caggaaaaac 
gggattatgg 
atatacttta 
tcttattacc 
ctgggcatga 
atcactcctg 
tttgtcccta 



atgtcttaca 
tcagatcaca 
gatttaatta 
tcaagatgaa 
ccaaatttca 
cttaactcat 
caagtcccct 
aataggggta 
aaaggggcta 
gttccaaaac 
tgttctccca 
ctggctgctt 
ctgtcagtgg 
ctaggcagtg 
actgccttag 
atccaggcat 
atttctgtgc 
cctctgaagc 
ggatacagta 
tctgggcctg 
ggctacgttg 
acattgagct 
tccccagaaa 
ggaatacatg 
gctgtatcca 
gcccccgact 
ctgattgttc 
tttagtttcc 
tcatcctttt 
cagtctatca 
gcaataaaca 
ataaccagta 
tgccacacta 
gtttttattt 
attctaactg 
agtgatgatc 
tgttcatatc 
aaagttcctt 
tctcccattc 
gctttagttt 
ttgtagtcat 
ctaaggtttt 
tgcttcctct 
ctctcaagtt 
gtatagcaag 
acctcagaag 
ttctgagccc 
tgagccctcc 
ttttgggtac 
tctcatgctg 
atggactcac 
gagcaaaacc 
tatgaaacca 
ccacccccat 
gagttacaat 
ttaactactt 
ttttttttga 
gccatcctcc 
gtctcatgtg 
ctttttctca 



104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
104580 
104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
106020 
106080 
106140 
106200 
106260 
106320 
106380 
106440 
106500 
106560 
106620 
106680 
106740 
106800 
106860 
106920 
106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
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gatttttttc 
ttattttcct 
gcccatttat 
ataaggatct 
cagaacactt 
ggaacttgac 
aattaaacta 
aaattatact 
attcatggat 
tatttataat 
ttaaattgaa 
tatgtaagta 
attggggagt 
gtagggtaga 
cccagcacct 
gttgagttta 
ctactcatga 
tgttcttcaa 
tagttaagag 
ttaaataata 
cttaaaggta 
cccagtgctt 
aacatcaccc 
agtgggattt 
caataataaa 
ttcttttata 
tgtccttttc 
taattgcttt 
gaattatgca 
ttccttgtga 
ccaaagcagt 
aagacctcta 
atacataatg 
aagtaagtta 
aagtgaaata 
ccagagaagt 
aatgaggagt 
caacaatgta 
aaataggaat 
gtatatcaaa 
gctgcctaaa 
atgcccaaga 
ctttccctac 
gtaggctgcc 
ggtgtggtcc 
ttaaaatata 
gattgaaaat 
gagatgccat 
ctgaggcagg 
accctgtctg 
cagctactca 
ttagccaaga 
aaaataaaaa 
agctacttag 
gcaacagcga 
aatacaggct 
ggaggattgc 
tctacaaaca 
tgttactaga 
tacatagcat 



aatgatgtct 
tcagatctca 
cagttttgtg 
ttatttatct 
tcatcaaaac 
agaagaggta 
aaacaaagct 
tagccaaaaa 
tactcagggg 
ttgtgaggga 
aaacagcagt 
tgggaaaggc 
tttgtgtggg 
ctgaacactc 
agaatgtgcc 
taggctagct 
ctactttact 
tgtataataa 
actttctcag 
gaaataaaag 
tggaaatgga 
tgtgccttct 
aacaagtagg 
agaatgctgc 
ccaagtgtcc 
tgctccttac 
agcattttct 
tttctgccag 
gtgggaattg 
ggcacttctc 
gtcatactcg 
tattccaggc 
tttttaaaaa 
atgcctggga 
ctccaggcac 
cacattcata 
tatttaatgg 
aatgtactta 
ttatcacgat 
aaaataaaaa 
tggtgaacca 
ttttagcctc 
catgccaagg 
ttaatccaga 
ttttcctcct 
ggcagtcctc 
atttggggga 
ctctacaaaa 
cagatcacct 
tactaaaaat 
ggaggctgag 
tcccaccact 
taaaaataaa 
ggggttgagg 
gaccccatct 
gggctcactg 
ttgtgctcag 
acaacaacag 
gaagtttctc 
ttacattgta 



atgcttttct 
acatctgctg 
ccttaggcag 
gaccacttgt 
aagcctgttt 
ataatacaga 
gctgcttata 
taaatattta 
tggaaattat 
gaaataaatc 
agtgtatttt 
tggtgaaaac 
aagtgtccta 
cacaagggta 
tggcactcaa 
gagtttatag 
ttctaatagc 
ggtgagaaca 
gtcacacaca 
tgattttata 
tgccaagaag 
cccagctgct 
ttaacagata 
cacatattga 
ctgtcccaac 
tcctcaggtg 
cagaggacaa 
atatttcctt 
cccaggatta 
aactctgcat 
gtgcctcctt 
cttgtctttg 
tcagtaatgt 
tacgtgctac 
aaaagcacga 
aatactgaaa 
gcacagagtt 
atatagtaca 
aaaaaattaa 
tagtccccac 
aattgtgaac 
aataaggagt 
ccatctgatt 
gtacacaagc 
gaacactgac 
tacatccaag 
aaaaaaaaca 
acattaaaat 
gaggtcagga 
acaaaaatta 
gcagggaaaa 
gcactgcagc 
taaaaattag 
tggcagtgag 
caaaacaaaa 
gcttatgcct 
gagttttaga 
gagactatac 
tgtgtagagc 
ttggttatta 



tgtttttttc 
caatttgact 
tattcaccag 
ttactcattc 
tttcttttca 
agaaatctag 
gtaaggtaga 
tagataattg 
ccctgtaatg 
cacaaacatt 
tgttaggccc 
tgtggattta 
acacctcagg 
gggagagcgt 
tcttttcttt 
gctggtagag 
actttatgtt 
gttttatctc 
tatagttaat 
attatctaga 
tatagtcagt 
gcttccagaa 
tccacgcccc 
tgattgaatt 
ttgttatctc 
ctctctcagg 
ttcttagctt 
gctaggctct 
ggaatcacct 
cccttatacc 
ttctctcctg 
attttaggga 
cccactcgtt 
aacatggatg 
atactgtatg 
gttgtatggt 
tcagcttgag 
cttaaaatgg 
aaagtaagaa 
aaatttccaa 
caatgtgtag 
agagttcata 
cccagtccaa 
cttccatttt 
tgtataatta 
gttccacatc 
ataaaaaaac 
attagctgag 
gttcgagacc 
gccaggcatg 
tttcttgaac 
ctgggtgaca 
ctgagcatag 
ctgtgatcgt 
acaataaaac 
gtaatcccag 
tcagcctggg 
tttcagggac 
attgaaatat 
taagtaatct 



tagcttttct 
gttcaggtta 
cattctccta 
atggggacat 
aaatataata 
agaactgatc 
ccaagtttgt 
aatagtagtt 
taggccccaa 
tgaaaatatc 
ttatattggg 
tctacaaaat 
aaatgctaag 
cttctcttca 
ccttttttaa 
tacaaggtct 
cttcaagtgc 
catcttgcag 
gacaaggtga 
gtagtttcaa 
cttgctggag 
gaacggggtg 
tcttgaccca 
tatgaagcat 
tgcttctgtg 
gacttttcag 
cctgttgatt 
tgagccctca 
aatgtcccca 
ttcacagcaa 
aaataaaatt 
aaaaaaagaa 
acagaaagga 
aaccatgagg 
gttccgctta 
ggtttccaag 
aggaggtggt 
ttaaaatggc 
aagttactgc 
aacaacccta 
tgtttgagac 
atttgactcc 
agaagttttc 
cttatctgtc 
ccagacaaaa 
cttggattca 
actggcctgg 
cattccagca 
agactggcca 
atggcagctg 
ccgtgaagca 
gagtgaaact 
tggcatgtgc 
gccactgcac 
agaacacaga 
aactttgaga 
taacacggca 
catttctggg 
aaaaatgcag 
agagattaat 



aaccttgcat 
gtgacaattt. 
cttgagatga 
ttaatattta 
tactagcata 
atggagaaat 
cctgtgttcc 
tttagaaatg 
acttctaaaa 
taattttaac 
taatatagtc 
acattggtac 
gaagaaatgg 
ttgctctagc 
ctgatttcag 
actaatttct 
tttttttctt 
ggagaagagt 
ggtttaaacc 
tgtgaaataa 
taaaataatg 
tctgagtgtg 
catacatatc 
ataatatcct 
aacacatgtt 
ttcctgacct 
cctcaagcat 
gagctgttct 
ccacccccgc 
ccctgtgtac 
cctagataag 
aactacctat 
gaaataaaag 
acattacacc 
gatgaggtac 
gggaggggga 
gacagttgta 
aaattttatg 
ttgggcgaaa 
atgaggtgtt 
tgggaaactg 
aaagacattt 
tctctgctct 
ctcctaccag 
ctaaacatat 
accaaccatg 
gcagcatagt 
ctttgggagg 
acatggcgaa 
cctatagtcc 
gaggttgcag 
ctgtctcaaa 
ccatggtccc 
tccagcctag 
ttaaaaacaa 
ggccaaggtg 
agaccacatc 
gatcatagtt 
aataatcatt 
taaagtatac 



107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
108120 
108180 
108240 
108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
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aggaggatat 
tccatagata 
taatgttaat 
gagcagttgg 
gatctaggcc 
ggtgcaaatt 
catgccttcc 
agccaacatc 
tggaaggatc 
actaatggcc 
aaccaacaga 
tattgccaag 
gctttctgaa 
ataaagaaat 
tctgcaggct 
gggagctttt 
agcaagagta 
ataactcact 
atccagtcac 
tatggaaggg 
tcacgcctgt 
gtttgagacc 
gctgggcatg 
ctcttgaacc 
gggcaaccga 
taatatcatt 
aattcatgta 
aggcaatgtc 
gtcactacat 
tgtacagcat 
ctaaacatgg 
atatgtggct 
ttgattagag 
ggtgacttta 
ttttttaata 
caggagtcca 
aagtatcgga 
ctgtcttgat 
aggatctgag 
aatcaaattt 
aggctgaggt 
aaaactctgt 
cccagctact 
gtgagccgag 
aaaaaaaaaa 
aaacctttca 
acagaaatta 
tttctatttt 
atctaaatcc 
cttatctaaa 
ttaaaaaaaa 
tgttttcaaa 
ttctgccctt 
tttttcatgt 
aaatggaaag 
ctgttaggcc 
taccacagtt 
aatgttggca 
gcacagttga 
tatctatcag 



acataggtta 
cgggtatctg 
ttcatttccc 
gatagacaat 
tactcaacaa 
tgagccagca 
aaagactgag 
attagcctaa 
catttttctt 
taatttcttt 
agaaggtaga 
gtcattaaat 
tgtatccttt 
acctgaggcc 
gtataggaag 
actcatggcg 
tgggtgggga 
tatcaccaag 
ctcccaccag 
acaaacattc 
aatcgcagca 
agcacgggca 
gtggcacaca 
cgtgggcaga 
gtgaggctct 
tcaactagcc 
cagtcgtgca 
attgttgtgc 
acctaggcta 
gttactgcac 
aaaaggtaca 
catctttgac 
actccataac 
tcactagatc 
aatgtaaaac 
catttacaga 
gtcagtcagt 
gaccagcata 
aatggtaagt 
aaatttaggc 
gggcagatca 
ctctgctaaa 
tgggaggctg 
atcacaccat 
aaaatcgaat 
ttgttcagga 
tttagagcca 
aagatgaagt 
aagctggttt 
tgatgctatg 
aaaaaaacaa 
attaggttct 
tggtagctat 
cttaacagtg 
atctgctgtt 
aaaggtctgc 
tagccacaac 
agcaaagaaa 
tcagattaaa 
tggccataaa 



catgcaaata 
aggaggtgtt 
caacctccac 
atcagaagta 
attttaatct 
attagatggc 
ggaaagagat 
atcttagtga 
cacttcagaa 
gtttctttca 
accaattttg 
aagttatcaa 
tctgttttaa 
ggatgattta 
catggcccca 
aaagcagagt 
gctgccatac 
ggaatggtgc 
gtcccacctc 
aaaccatatc 
ctttgagagg 
acatggtgaa 
cctgtaatct 
ggttgcagtg 
gtctaaaaaa 
ttttgttggg 
tctcttaaca 
aaacatcata 
tatggcatag 
tgaatgctgt 
gtgaaagtac 
cagaaatgtt 
ccacacctac 
agtccaagaa 
atggaagtac 
gcctttgcaa 
catgcaggta 
gacacggtct 
ctggtgtgcc 
tgggtgtggg 
cctgaggtca 
aacacaaaat 
acgcaggaga 
tgcattccag 
gtaggtggaa 
aggaacagaa 
ctaaataatt 
ctactttaaa 
tggttggtaa 
tcatagatgg 
gtagaaataa 
ttaaaccatt 
gaaataatct 
ccatgcatga 
taaaaaaaca 
tctccacctg 
agaaatatcg 
tttttctcta 
gaagctaaga 
gttttatttc 



ctacaccatt 
gggttcagtt 
accagaactc 
tgtggaaatg 
tcattctggt 
tcttaggatt 
agataggaga 
agagaggtta 
ttaagggaaa 
aaaatcaaat 
tgaatttaca 
gttacctcag 
gaagaagctg 
taaagaaaag 
gcatctgctc 
ggaagcagca 
tcttaaccca 
taagccactt 
caacattggg 
agaagcctat 
ccgaggcagg 
accccatctc 
cagctactcg 
aactgagatt 
aagagaagcc 
tgcatttgtt 
atgaggatat 
gagtgtactt 
gtagagccta 
aggcagttgt 
aggattataa 
attatgtagc 
ctgttcattt 
taagtttaaa 
atgatacctc 
ccacttttat 
ggctgccaac 
gttgtgttca 
aagaatgaaa 
ggctcatgcc 
ggagtttgag 
tagctgggca 
attgcttgaa 
cctgggtgac 
ttaataaaaa 
caatttttga 
tcaaattacc 
cttctaaaat 
ttcctctagg 
actgtttgtt 
aggcttttaa 
agccatctct 
gcattccaga 
ttatctacac 
aacaaccaat 
gtcagctgct 
ttgggctacc 
agcacatctt 
ttacattttg 
agtgttttct 



ttatataggg 
ctccacggat 
tgaaataaga 
ataacagtgg 
aaaaacaaat 
aataaaaaaa 
ctttggcaaa 
gaagaaaggt 
aattaggaag 
ctttaagtta 
aaatacttgc 
atcccagtga 
tattaggttc 
aggtttaatt 
agttttctgg 
ggtcacttga 
atctctagtg 
gtgatggatc 
aatcacattt 
cttaggctgg 
cagatcattt 
tactaaaaat 
ggaggctgag 
ctgccactgc 
tatattaaac 
cactttggac 
gttctgagaa 
agacaaccct 
ttgctcctag 
aacacatggt 
tcttatggaa 
acatgactgt 
gcatgcacat 
gagcatgtcc 
aaaggagcat 
atcttcaaag 
catggctgag 
ttcaaacttg 
ctccaagaaa 
agtagtccca 
accagcctgg 
tggtggtgga 
tccgggaggt 
aagaccaaaa 
tttaaataaa 
taacttgtga 
taaaaatcct 
gcagggttat 
acattttact 
ttgtttgtaa 
agaattttta 
tccttctgaa 
aacttctttt 
catggaaacc 
cacccatgcc 
gaggaggggg 
aaaagatgtc 
agagatactt 
gggctggtct 
ggtgtaaagg 



gacttgatca 
accaagagac 
ataagaaaag 
aaggaaagct 
tagatttatg 
gactgaacat 
gtagcacttt 
agaattttca 
ctgaataaga 
aaatttcaat 
ttattgacac 
tttaaataat 
ttgtgttcct 
ggctcacgat 
tgaggtctcg 
tgaaattgag 
aacacaagca 
cacctccaaa 
caacatgaga 
gcacggtggc 
gaggtcagga 
acaaaaacta 
gcaggaagct 
actccagtct 
ttataaaatt 
tatttttcca 
atgcatcctt 
acatggtgta 
gctacaaacc 
atttgtgtat 
tcgctgttat 
aattccacta 
ttagccgata 
ccttttcttg 
gaatcacaaa 
tgctctacca 
ccctcaggag 
ccaggaacca 
gaaacttcag 
gcgctttggg 
gcaacatggt 
cacctgtaat 
ggaggtggca 
tgccatctca 
ctaacagatt 
aatatctggc 
agtgatttat 
ttaaactggc 
aaatcttgat 
tccagggaaa 
gagttagaaa 
ctcttctttt 
ttcccagtcc 
catcttaatg 
tcctacagtc 
caggtacctg 
agtccttaaa 
tagtctaaag 
aatcgataaa 
gattaatttt 



111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
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atgtactcaa 
ggtgtgcagt 
ctctagcaaa 
aataaactgt 
agtgttaagt 
tctactccct 
tttcaaaaaa 
gttggatgga 
aagctgccag 
attggaaagc 
tagcatttat 
cctcccctga 
cacccctttc 
aatcattata 
atagttaact 
ctcagttctg 
gggcctgttt 
ttgtcagtca 
gtctccagta 
gtgtgcacca 
attaaacaaa 
caaattacac 
gcatccttgc 
gatttggggt 
cttgagtttc 
catttaactt 
tcagaacacc 
gagaggagag 
gcctgagacc 
ttcagttggc 
tccagcactt 
ctgggcaaca 
tggcatgtac 
aggaggtcga 
gtgaaactct 
ttagcaaggt 
aacataaata 
ccttgggagg 
atggtgaaac 
tgtaattcca 
gttgcagtga 
tcaaaaaaaa 
acttatcttt 
gacagtgacc 
tctcttcaac 
ttcactttct 
tttataaaac 
tgggtactac 
atactgacca 
gaagctgaag 
tgccccttct 
acttgaggtc 
aaaatacaaa 
ctgagggagg 
cactgcactc 
aataaataaa 
tttggattta 
ttttaaggat 
ccatttgaca 
ttttatgaat 



gtctatggag 
gctcatgcaa 
aacacagcta 
ctcatttcat 
tttgatttgc 
ttcagatgta 
gtatgtttag 
tattttctta 
tctctggtat 
aggaagttga 
tatcccattt 
agagtttatt 
cagttcgatg 
aacgaataca 
gcagcagcag 
cctagaagac 
ttgccctgct 
gctgttactc 
atcccccttc 
gtattttatc 
aagtactcac 
agtcttagga 
tttgcaggtg 
tcccttctgt 
taatgcatgt 
cctctgtggt 
atgtgatgtc 
aggccaaggc 
tctcaaacag 
tggtcagaaa 
tgggaggtca 
taacaggacc 
ctgtggtccc 
ggctgcagtg 
gtctcaagaa 
ttttttaatc 
aaagatgaat 
ctgaggcaag 
cccgtctcta 
gctactcagg 
gctgagatca 
aaaaaaaaaa 
aaagttaact 
tctagtataa 
attttcatat 
gctgaactaa 
agcagcagtc 
tgtctcagtg 
aagaaacttt 
ttcttgcttt 
aggccaggcg 
aggagttttg 
aattagctgg 
agaatcgctt 
cagcctgggc 
taaaataatg 
cccaatttat 
tacaaagtag 
gacaggtgtt 
actaaagaat 



tttgttttct 
aacttcctaa 
gaggaaacac. 
tcagaagatt 
aacaaactca 
cgtctatttt 
gcagaatggt 
gaattactcc 
gaatagcagt 
aagaatcagt 
tttgatgtac 
ctcctagaca 
atgtgtatct 
gatgatgact 
tggatgataa 
agttgtagtt 
ttcctggcct 
caagtcctct 
tttttaaact 
atatttattt 
ccccttaaaa 
aacagtacag 
tgtaccacca 
tgagcattct 
tcgttggctt 
tgatatatga 
tccaaatgtc 
tgaggccatt 
gatctgtctg 
cagttaccca 
agggtggagg 
atgtcactac 
aggtactcgg 
agctgtgatt 
gaaaaaatgt 
ccctggaagt 
tcctggccga 
tggatcatga 
ctaaagatac 
aggctgaggc 
tgccactgca 
agatgaattc 
ttaaacaact 
gctttaaatt 
ctctacaact 
agtgcatgcc 
atggacagcc 
agaaaggatc 
tttagtttgc 
atttaaataa 
tggtggctca 
agaccagtct 
gcgtggtggt 
gaaccaggta 
gacagcgaaa 
ccccttctag 
cttctttcaa 
agcagggggg 
cttagtttca 
gcctcttatt 



tccaccttta 
gggagtgccc 
aaatcgttta 
tctgttcatc 
gattatccat 
tttaaagggc 
cttctagagt 
aaaatgcagt 
gagtaaaagt 
ttgaattcca 
catgtaaatg 
taaaatgagt 
gatgaggaag 
gggtgactga 
ctactgggtg 
attttagtcc 
ctgtatacgg 
gtcagaaata 
atgactcccc 
aataaattcc 
ggaggaagat 
taggtaatta 
ctgcacttca 
tctatcgtga 
tattggagct 
gagctagagg 
ttacactgtg 
ccagtgagat 
tgggctctag 
cggccgggtg 
attgcttaag 
aaaaatgatt 
gaggctgagg 
gcgccagtgc 
ttcaggcaca 
gtgcaaccca 
gcgcggtggc 
ggtcaggagt 
aaaaataagc 

aggggaattg 

ctccagcctg 
ctgacactta 
gctctacagc 
ggtttaaaaa 
ctttattaga 
agtctccaga 
taggccacaa 
cagagatcca 
agattttggt 
tcatggcagt 
tgcactttag 
ggccaacttg 
gcacacctgt 
ggtggaggtt 
ccccatctca 
ttgaaaaact 
aatacctcaa 
aaataataaa 
taaaaaaaaa 
tttgtttgtt 



atcacctcac 

cacttgtttt 

aaaaagaaag 

ctacttttaa 

taacccacgc 

ccaggctttt 

tatggaatag 

aaatgagaag 

gctttgaact 

taattgctgt 

taaaacatac 

gtacatgcct 

gacagagatg 

aatcttatct 

ggggtagagg 

cacagtctct 

cctttgttag 

aagttacttt 

aaagatatat 

ataaacctta 

aatgaccatc 

ttggaccttt 

caggtttctg 

aaccatttgg 

gcttttgtac 

actctgcctt 

gctcccctgt 

gatcactttc 

tgaggggaaa 

tggtggctca 

cccagcagtt 

taaaaattag 

taggaggatt 

actcaagcct 

gtcagtgttg 

ggattaaaca 

tcacacctgt 

tcgagaccag 

cgggcgtggt 

cttgaacccg 

ggtgacagag 

atacatttag 

ataatattat 

atagattaag 

aatatccatt 

ggatgacaga 

gtagaacata 

gcatcaccag 

gggaataggg 

aacaaaattc 

gaagccgaag 

gtgaaacttc 

aatcccagcc 

gcagtgagct 

aataaataaa 

aagttcctac 

acatttcacc 

ccactaataa 

gatgcctggt 

ggagacaggt 



tcggcttgct 
aatttacttc 
aaaaaaacaa 
gtcaaagggt 
ttgcaggctt 
tgagtttacc 
cttttcaaat 
caacaagctg 
tagtgtccgt 
aaatattatg 
aggtcccctc 
agccattgtc 
agagaatcaa 
cccaaagagc 
ggttgttttg 
actcctccct 
cactttgtag 
tgtttgaaga 
agtctaactt 
cataaaagat 
aaggcatggc 
ttgattaatg 
taaataaagt 
tgatgagaag 
tgtggccacc 
agcttcttca 
aagagaggtg 
ttaggctact 
gtaaccggcc 
tgcctgtaat 
tgagaccagc 
ctggatgtgg 
gtttaagccc 
gggcaacaga 
aagatgtcaa 
gatgtttaag 
aatcccagca 
cctgaccaac 
ggcgggcacc 
ggaggcggag 
tgagactcca 
ataaagtcta 
cccctgtgta 
tatgaaaaca 
ttgtcctata 
tgggcagaaa 
ccagacctcc 
attccctcac 
aggagtatta 
ggttaaataa 
caggcagatc 
atctctacta 
actctggagg 
gagatcacac 
taaataaata 
ctaagcataa 
ttattattct 
agaataatag 
agattcagtc 
tcattttgaa 
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cctaacctgg 
aagcaagaag 
actaaaatac 
atcatcagga 
gctgtcagtt 
caggcaattt 
tgaaacagga 
gacattaaca 
caataacaag 
aagaacatgt 
ggattagggc 
ttgtagagct 
aatccggatt 
ttggtagacc 
gttttactta 
aatccaggct 
aaggtacaaa 
gtaaatgtag 
ggaggattat 
gaaagagaag 
atgagaagtg 
tatgggtggt 
gtggtgtggg 
ggctatctat 
aaggtccttt 
aacactgttc 
gtaggtactt 
agttatccaa 
cctctccaga 
ctaatgctac 
tctgttataa 
ctccagaatt 
gaccttgaga 
gtagcttata 
cttttttttt 
caatcagctc 
cgagtagctg 
gagatgaggt 
cccacctcga 
aaatgctttt 
ttatcacagg 
tgcagagaag 
tcagtgtggt 
tgaaaaaaat 
gctctttggc 
tttttacacc 
tacttacatg 
tcttattgcc 
cgggttcaag 
accacaccag 
ggctggtctt 
aaattcttct 
aaggacaaaa 
atttctttgt 
ctatgaagaa 
atgtcttatg 
ctcgtgtttt 
agaacaggta 
gtgaaacatg 
gtgttgcagc 



tgttctcagg 
catgttagaa 
catgaggaac 
ttcagtggtg 
tctgccatgg 
gtgagggtga 
gtggatatgt 
gggctgggta 
aagttaggaa 
tggcatgttt 
tgagatcaaa 
ggatcacaca 
tctgactttt 
tatatgataa 
tgttatttcg 
tataagtcag 
gatagattag 
gcagtgggcc 
tctggaccag 
agaaaaagga 
ttttaaagag 
tgtatttggt 
ggacggtgat 
ggcccacagg 
aaaaaaaaga 
taagctcttt 
ttccaatttc 
agccacaagg 
ggtttatctt 
acagataggc 
ctcgtttcag 
gacccttgaa 
gaacaggtat 
attgcatgct 
tttttggaga 
actgcaacct 
ggattacagg 
ttcaccacgt 
cctcccaaaa 
aaaaatgaaa 
acaagtttca 
caaatgcatt 
ttctttctct 
aaagaggttg 
cggattgcag 
ttttttcccc 
cttaaattct 
caggctggag 
ggattctcct 
gctaattttt 
gaactcctga 
ataaaaatga 
ttagttatta 
tttgtcgtgg 
tgttttttgg 
aaatttggtt 
ggggtgagca 
ttgaagtgac 
gtctgtgccc 
acagtggaat 



gaacataggt 
gactgctgta 
gtgggccaaa 
actcttggat 
gtagttgggc 
gttgagttca 
ctaattttac 
ggaacacgga 
gttgaagagg 
aagtgacatt 
gcaggcctaa 
gtgtcctaaa 
cttttaaact 
taatcgactg 
ctgagccctt 
atgaattttc 
caacagatta 
acctttgaat 
tatgaaaaca 
gatggtgttg 
atatcaccca 
gggaaatttg 
tgtcaggcat 
tcactgtagt 
aagctaaaat 
acacagacat 
cattttacag 
ctgataagaa 
ttaactgcta 
aactttacag 
gggtgtcaat 
ctgtctctta 
gttcaaatgt 
tctaaccctt 
cggagtcttg 
ctgcctcccg 
catgcgtcac 
tggccaggct 
tgctgggatt 
actaaaacat 
taaatattga 
tgatgcaaag 
ttggcctata 
ttctcatgtg 
attatgtgcc 
tatcaaaaat 
ttttttttct 
tgcaatggca 
gccttagcct 
gtattttttt 
cctcaggtga 
gaaatatttt 
gtttgggttt 
aactccacca 
tgtagttctc 
gtgtgtgtga 
taaacaaatg 
tccttcatgc 
tcacctagta 
aacgggtttg 



tagaggggag 
gtggtcccgg 
ggaaacaagt 
ttgaagaggg 
taacagtagt 
gcacagggcc 
atgaagtgtc 
atacctcagg 
gtagaagagg 
gtgaataaac 
tgcagtttgc 
tttaaattag 
aaagggctcc 
aagtatatgt 
agacatttaa 
tacttctgag 
cggagagctt 
aaggaattga 
cagaagttag 
ggatacataa 
gaaagtctat 
cttgcagact 
tatggaaagg 
ctctgttata 
taatacctga 
tttatttaat 
acaaagaaac 
cagaaccagg 
tgttaaactg 
gtagaggacc 
ttggtccagg 
gggagcagat 
ctgttctctt 
tgttgaaaaa 
ctctgtcgcc 
ggttcaagct 
cacgcctggc 
ggtctcgaac 
acaggcgtga 
gttaattttt 
aatttggaaa 
cctcaaattt 
gatgaaacat 
cattggcgtc 
tggtgtgaag 
tactctcatc 
ttcctttttc 
cgatctcagc 
cccaggtagc 
agtagagatg 
tctgcccgcc 
ctacaacata 
taaacatgag 
aaccagaatg 
atagtcatgt 
cttttcagct 
ctaattccaa 
cacacactct 
gctaatggtc 
ctagatgagt 



agatttagga 
tgaggagtga 
ctgagagatt 
aaagggaata 
gtattaactg 
tgttgacttt 
tacttaagag 
tgccaaatga 
cagcagaaat 
ctgtcctact 
agacgtgttt 
gtgccaacat 
tagaggtaga 
cctttctctc 
attttgtact 
tcaaagatca 
tgaataccaa 
tgagattaaa 
ggaaaacagt 
atgggctttt 
gcactgccac 
tccagaactc 
tcaaacaaaa 
agtacactaa 
ttgttattaa 
cctcgcaacc 
tgaaacccta 
acttgaacgc 
cccctgcatt 
ttatgcttta 
tcctcctgga 
aatgtaacgg 
cctttagcta 
taaaaactct 
caggctggag 
attctcctgc 
taatttttat 
tcctggcctc 
gccaccatgc 
ttcaaatgtt 
aagttgcaag 
gtcaagtttt 
gtaaatgaaa 
acttcaggag 
aaggtttatc 
acccaatgtc 
tttttgagat 
tctccgcaac 
tgggattaca 
gggtttctcc 
tcggcctccc 
acttctatag 
aaattggcaa 
gttttcatcc 
gcagatcctg 
tctttactgc 
gatcattgct 
gctaaatgct 
tcatgggaga 
gccatagcaa 



gtgagggctc 
aaggaatgga 
tagaaagtaa 
atctagggta 
aaatgggggg 
gaggtgcctt 

aatgtgtaga 
aaacttttgg 
gtagatggat 
gggccagatg 
tgttagactt 
ttccacacaa 
tttggcaaca 
caatgaacct 
tttttttttt 
gtaggtaata 
tctaaggagt 
gccatattta 
taatagtttt 
aaaatgcaaa 
atgggcacta 
agaccaatgt 
tatgctcact 
gtggaggaga 
ctgtgtgcca 
aatttctgaa 
gaggttaaga 
aagcagtctg 
ttaatctgtt 
ttctggatgc 
agaaataaaa 
gtccttgggt 
atggatcagt 
tataaacatg 
tgcagttgtg 
ctcagcctcc 
atttttagta 
aggtgatcca 
ccggccttaa 
ttcatgaaaa 
cctataacat 
tctaccatat 
gatttcaaga 
ttggacgact 
tacaggtact 
tcattaaatg 
ggagtttcgc 
ctccacctcc 

ggcgtgtgcc 
atgttggtca 
aaagtggctt 
gcagtttttc 
tgaaacaaca 
attgcttttt 
tgccctttgc 
aaattgcctc 
gacaatcaac 
gaagacttaa 
agataaagca 
cacaggcacc 
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atgcatctga 
gataagggag 
cactgtggag 
gggagaatag 
tatgtacatg 
agtgacataa 
ctccgtttcc 
cgggctcaag 
accatgcctg 
gttgcccagg 
agtgctgggt 
ttcggctgtg 
agatcacctg 
tacaaaaatt 



agcaggagaa 
gcactccagc 
ggtaactgtg 
ggccacatac 
tccatagaga 
gtaactgcta 
aaactcactg 
ataaagctat 
ggaggctgaa 
gacctcatct 
cctatagtct 
attacagggg 
tctctaaaat 
tagtctaggc 
ctagggacat 
ctcaggtaat 
gtatgatgat 
cttgccccag 
gtagtgttta 
ggttaggaag 
agatggaaag 
gcaggagtgg 
tttagaatag 
tgatcatgat 
gaaggaaggt 
aattatatta 
gacacaggaa 
gaagtaataa 
ctctacatgc 
ttatttccac 
tgtgtgtata 
aagagagttg 
agtacttcac 
ccctctaatg 
tgcctcctgt 
atggtccctg 
ccaagaagat 
acagagcctt 
gcgcagtggt 
ggtcaggaga 
ggcttggtgg 
tgaatctggg 
tgacagagtg 
cgtaggtctg 
cagggaggga 
ccaggtgaga 



gtcactagtg 
acagagaagc 
gtgtgacatg 
tggatggatt 
agctctatcc 
attggggtgg 
caggctggag 
caatcctccc 
gctaattttt 
ctggtctcga 
ttacaggtgt 
cgtggtggct 
aggccaagag 
atccaggcat 
tcacttgaac 
ctgggtgaca 
aagagtggaa 
tgaatgattc 
aagaaggtag 
ctgtgtttct 
aattgtgcac 
ttttaaaaag 
gcaggaggat 
ctaccaaaaa 
tagctatgca 
agtatggtca 
ttagagaaaa 
atctaggcag 
agagaatgag 
ggagtagatg 
gaattggggt 
tgatgagttt 
gagctcatgg 
cctgtgggtt 
ggagtggcca 
tcagcagtgt 
ggaggctatg 
catttgaaca 
aaaggcaagc 
tacatgagag 
aatgattgga 
cagtcaacca 
attatctcat 
ttagagatgt 
gaggtccagg 
gtaatgtgct 
tgtggacatc 
gacatttatt 
gccccaagca 
agttcctaaa 
ggggaaagtg 
gaaagccagg 
tcacgcctgt 
tcgaaacaca 
cacgcgtctg 
aggcagaggt 
agactccgtc 
attttgtttt 
aacaagagtg 
gatgatggac 



aagggctgag 

atgctccgag 

ggatgttata 

ggttgggggg 

tgcaatctac 

cgggggttgg 

tacagtggtg 

acctcaaact 

gtttttttgt 

actcctggac 

gagccatgcc 

catgcctgta 

tttgagatta 

ggtggcacat 

ctgagaggca 

gagcaagact 

tggagtggag 

catttatatg 

atttgtggtt 

ttgtggggta 

tttaagtggg 

gccaggcatg 

cacttgaggc 

aaaaaaattt 

ggaggctgag 

tgccactgta 

agagtagagg 

gaagaaccaa 

aatttgatga 

attttgccat 

tttttaaatt 

gtttttgcca 

aagagtctgg 

tggcattcat 

gataggcagg 

cagtagctag 

agtgagctta 

aatggaaata 

cacaaggggg 

gcagtatttg 

aaaggcatta 

tttattgagg 

tgatccttgc 

gaacactgag 

cagtcacagg 

gcttttaaaa 

ctcatctaac 

gcactacaga 

ccagctcaga 

gcaagcctga 

gcatgggaag 

atgaagagac 

aaccccagca 

gtgaaacccc 

tagtcccagc 

tgcagtgagc 

tcaaaaaaaa 

taaagtcact 

caagagggga 

tatggtgctg 



cagagttaag 

cagagacaaa 

tctgggaaac 

atggacagga 

taggagctat 

tgttttttgg 

ccatcacagc 

cctgagtaga 

tttttttctg 

tccaacgatc 

cagcctgagt 

atcccagcac 

gacatggtga 

tcctgtaatc 

gaggttacag 

ccatctcaga 

gaggggacaa 

aagtgtccag 

gtaggggcta 

atgaaaatgt 

tgaatttcat 

gcagctcatg 

cagttcgaga 

ttttaagtta 

gtgggaggac 

ctccagccta 

tccagggact 

ggcagagata 

agccacttag 

cttatcaaga 

aaaaaaaaat 

tgttgtttgt 

ataagaggcc 

gggctgcatg 

gggaaaacag 

tgacaaagag 

cagagaatgg 

tagacaactt 

ggtcttaggt 

agcagggcct 

cagataaagt 

actttcatgt 

cccagccctt 

gaactgagag 

acagtctggt 

atgtatttat 

atttagtttt 

ctttcagctt 

tgctgttagg 

ggcaggaaga 

ggactggaaa 

tttgctttaa 

ctttgagagg 

atctctacta 

tactggggag 

cgagatcgca 

aaaaaaaagt 

gtggttgttc 

ggaagacctg 

taatagataa 



aggtgaggac 

cacatgggaa 

aaatgtttga 

agcagcttgg 

taaaggattt 

ttttttgagg 

tcactgcagc 

tgggactaca 

tagtaacagg 

cacccatttc 

tgtgttttat 

tttgggaggc 

aaccccgtct 

ccagctactc 

tgagctgaga 

gaaaaaaaaa 

gagttgagat 

aataggaatc 

gggaaaggag 

tataaaatta 

agaatatgaa 

cccataatat 

caatcctgga 

gccaggcacg 

tttgagccta 

ggtgacagag 

agttggaaac 

ctgagatcaa 

gctggatctg 

tcaatatgca 

tttcacccaa 

gatgcctgta 

tgtgggttaa 

aagtaagtgg 

ggatgagagt 

gctgaaaggt 

tttctacagg 

tgtccaaatg 

tttgaggatt 

tgaaaaagag 

taacttccat 

gccagacaat 

taagagagaa 

gctacctcac 

cacacagcta 

tgtatcatga 

gttctctagt 

tcatacattc 

gtgatgcaaa 

agccgaatgt 

gttagagtgg 

gagtagtgga 

ccaaggctgg 

taattacaaa 

gctgaggcag 

ctactgcact 

agtggacctt 

tgtgtttgga 

ttaggagatc 

agtcatgata 



caagtgtgta 

acacctccca 

atggaataaa 

gaagggcggt 

taagccagag 

caaggtgtca 

ctcaacatcc 

ggtgcgtgcc 

gtttcgctgt 

agcctcccaa 

aaagatcaat 

tgaggcaggt 

ctactaaaaa 

agcaggctga 

ttgtgccact 

aaatcaatct 

agtcacaaaa 

tctgtctctt 

ggaatgggga 

gatagtgatg 

ttatatctca 

caacacttta 

accatagtga 

gtggcacatg 

ggactttgag 

tgacaccatg 

tatattaaag 

atggaataag 

agatttgtgg 

gtagatctaa 

cctacctgcc 

ggaaagccaa 

gaggcctgtg 

aggactaagg 

gtttctaggg 

tttcattgaa 

tgcagaaaat 

cttggctgtg 

aagcctgtct 

gtgaaatttg 

taattgactt 

gaactaaggg 

ggtaccattg 

tgtgggtgtc 

gaaggaagtg 

tcactttgtc 

gtcaagggag 

aaaaattgag 

caagacagac 

gtgtggaaac 

gtcagattag 

ttttggccgg 

cggatcacga 

aaattagcca 

gagagtcgct 

ccagcctggg 

tgttggagga 

ggatggatta 

aattcagggt 

tattttatat 
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atcagcattt 
attatctgca 
agaaaatata 
aaaagttagt 
ttctgttagg 
tagcacatgc 
gaaagccctt 
cacagcggca 
attcactacc 
agtgtggttt 
ttccttcctt 
atgctcctgg 
tccctgcttt 
gaagccagtt 
ccctcccgca 
caagagcaca 
atcagcagag 
actggatgat 
gcagatcctg 
gtctttgctt 
ctgaatttaa 
atattgagtg 
gcagaaggat 
atctctacaa 
acttgggaga 
gtgatcatgc 
aaaaaataaa 
tgcattacgt 
ctttattggg 
tataggcctg 
tataagtgtt 
aagaattgaa 
ggacatattg 
acacatgtgc 
gaggaggtga 
tactgtgtac 
aatgtagaaa 
caggtagaat 
taaagatatc 
aaatccaaat 
ctcatctaga 
aagtgagagt 
gtgacactgg 
ttcctttagt 
cctagccaga 
ttacatgtct 
agctacactt 
tgagatagaa 
aactcataga 
tgatatattt 
taagtgtcag 
taaccagagc 
ttcatctttc 
ttattctggg 
atgtgctcct 
tcacctactt 
atcactacag 
aagaatagat 
atttaaataa 
gaatggagaa 



tattccaaaa 
acatttttgg 
gcctttccta 
attgggggtc 
ccttatctaa 
tgagtcacag 
cagtgtgtcc 
caggacgcaa 
tcctctctct 
ttgattcttg 
tttctccatc 
gcttaccttc 
catctttcaa 
ctgagtagca 
tggtctagtt 
ggctctggag 
ggctgtggtg 
taataaatcc 
atttggaata 
ataggtaaca 
acttccatca 
gttggttgca 
cccttgaccc 
aaaattaaaa 
ttgagatggg 
cactgcatac 
aaaaaatgct 
agaataataa 
acaggagaag 
tgtttctagt 
ttggaagggt 
aacttggaat 
aagcatggac 
acaggaacgc 
atgggcatgg 
atttctctaa 
tattaatgac 
gaattggatc 
ttgggatgag 
acttaacaac 
ctgtggcact 
aaccatgtgc 
cctctgtgtg 
ttgtgatggg 
ggttctgaaa 
atctccaaga 
aagattcttg 
atgggaaaag 
aggttaatac 
ccttaaagaa 
tgccagagcc 
tgctgtgttt 
acactgagtc 
gagtgatgaa 
tgtggcccag 
acctccacat 
tagctacaaa 
gtgtaaattt 
ctacataaat 
aaggacaact 



caatgtttga 
agcacatttt 
acttgtccca 
aaatatttca 
tttttgcatc 
gtgtgacagc 
tgtggttgcg 
cacaggccta 
cctttttcat 
gctcttctct 
cctggctctc 
tttccattca 
cttcttgggg 
gccaggccta 
tcagatcatg 
tgagaagcca 
aggattcttg 
ttgcatctgt 
ggtctcaaga 
aagaatttat 
aggtcctggg 
gtagcttatg 
caggaatgca 
agttagctgg. 
aagattgctt 
cagtctgggt 
gagtcaagtc 
tggttaaaga 
ctacgtgtgt 
caccattaaa 
gagaatattt 
ttagatcact 
atcatgtcca 
tctggaagga 
gttgagagtg 
ggtttgaatt 
tttcctatcc 
agtgttgctc 
ggtatatact 
ttcccagcct 
gggtgtgaac 
catggagaag 
cttcagaaga 
ccagcagaac 
gtctggagac 
tgccttcttt 
agcaaaacct 
taagagcaaa 
ttctatagcc 
aacaaaagca 
taggagaaaa 
gagagcaagt 
ccagcatcac 
gcattgaacc 
gcatccctgc 
ttagccagat 
taaggtggaa 
ctataaacac 
cattgcctaa 
tgcagtcagg 



acgtcctcct 
aatacctgac 
tgttggtctg 
ctttagatga 
caaatgtaga 
tgcatttcaa 
aggagccact 
acccatgcag 
ggcttccttg 
gctgcagttg 
tgccttttgg 
ttggtggtgt 
ctgtgacaac 
gaacactggt 
gtaatcccta 
gttcacccca 
gtggcaggtt 
tatgttttaa 
aaggagacta 
gaatttatct 
tcccaggtgt 
cctataatcc 
agaccagcct 
acatggtggc 
gagctgggga 
ggcagagtaa 
tactgctcct 
aaagctttgc 
cgtggtatct 
ttactacaga 
caaaggtttg 
tttccattgt 
tgcacataaa 
tgcaaaccga 
agaagaggtt 
ttaaaaaata 
atcagtctgc 
cattttccat 
ttatcaaatg 
aagtaaatga 
gtatcaaatg 
gttgacagac 
atccagccac 
tctgttgcca 
tttatattgg 
tattcaggtg 
aaactgacat 
aggaatcttt 
tgtactatta 
atatagacat 
taaggagcct 
tttaaaagat 
cagatcagta 
tgctatatgt 
atataaggaa 
tctgggtatt 
gaagcaaagt 
aagtcttaaa 
ttttaataag 
gaaagtatta 



accctagata 
tgtttccagt 
cagttacaca 
aagtttagcc 
gcatcgtttg 
acaagcctga 
cacggactcc 
gatgctggac 
ccccaacatc 
actccagctc 
cccatccttt 
gttttctcag 
tcttccctct 
gacacagaca 
gtctaggagg 
gttctaccac 
atgtaaagtg 
tatcttatta 
gggtctaatc 
cacatttttg 
tttcaaccta 
cagcgctttg 
gggcaacata 
ctacacctgt 
ggttgaggct 
gactttgtct 
gccagaagag 
aaaagttccc 
ttttactatt 
tttgtgtttt 
aaagttaaaa 
gtcatatttc 
gcagacaacc 
actgttaagt 
gtaggaagac 
tattcatgag 
ctaagcttcc 
tttagcattt 
taagctattt 
tcagaggctc 
catgtttctc 
tccctgtgaa 
ctgctgtgtg 
actgttttct 
ctaaacttta 
cagctcattg 
ttctccagca 
tgttctcatg 
taacaagtat 
ctaaactgtc 
gtgaattcct 
tgtatgtaat 
tttgatgcct 
atagtttgcc 
taggtacgtt 
aacatctgct 
atttttctga 
ggaaaatgaa 
aatataactt 
agaaataata 



gggcagacag 
aatttacaaa 
gcagtaagtt 
acaatctggc 
tggcacccag 
gaaggagaaa 
accttgtgaa 
tcgttccctt 
ccaaacacac 
tgcctgtttg 
aaggcttgga 
acatacctgc 
ttgtcccctg 
cacttcatag 
ctgccgagcc 
caacttgaga 
cctagcccag 
aatactgaaa 
ctgaattaga 
cttaagagtt 
ataattctaa 
ggaggccaag 
gtgagacctc 
agtcccagct 
gcagtgagct 
caaaaaaata 
atgactgaag 
agaatatata 
ttcttaatct 
taatgtaata 
ctgtgtatga 
tctgtgacat 
cagacaacac 
gttgattcct 
tttcatatat 
ttacttttgt 
tcttccggtt 
tacatttgcc 
ccaaagtagt 
cgtactcagt 
catcaggcag 
gcacttcgaa 
gcctgacatt 
gtcctgggtg 
ggaacgtcaa 
tttcctcttg 
atgctctcct 
tgcatacact 
tatatattta 
actggcttat 
tactcgaatc 
actaagttta 
ggatcaatct 
gagcgtcggc 
ctcacgagcc 
gggaaagagc 
gaagtactta 
aaaattttac 
aatatagctt 
tgctcagtct 
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gggcgcggtg 
gaggtcaaga 
acaaaaaagt 
ggcaggaaaa 
gcactccagc 
aagaaatatt 
tacaaaactg 
aaattgttct 
tgtactaggc 
taaagaaatt 
tactgaccaa 
aaaaccttgt 
aaatccattg 
gtgattatag 
tgtacataga 
cgcctataat 
caagaatggc 
ggcatagtgg 
ggaggttgag 
tgagacctgt 
actacttaaa 
ttaaatctct 
gtcatagaga 
aggctgtagg 
tgcattggag 
ttcttttgaa 
ttttcttttc 
ggggcgatct 
gcctccgagt 
tttttgtatt 
tgacctcgtg 
cgcgcccagc 
tttagtgatt 
acccaatatg 
tgttaaagtt 
cagaggccct 
aggtccgtag 
tacagctgct 
tccaggcaca 
ctccatcttc 
aatggagata 
caatctaaag 
ctattatatg 
gaattgttca 
ctgtcgccca 
gattcacgcc 
cacgcccggc 
atggtctcga 
acaggcatga 
aaaaaagtta 
aactacggca 
taacccaagg 
gcctaccaaa 
tgaaagtgtt 
atctataatc 
ttaatgaatt 
ctagtccaaa 
gggagggaaa 
aatcccagca 
gcctgcccaa 



gctcatgcct 

gatcaagacc 

agccaggcgt 

tgttgtgaac 

ctgggtgaca 

atgctcaaaa 

accaaagaac 

aaaacaacat 

ttccaacact 

tgttggcaag 

aatagaaaaa 

ataattacag 

aatctggccc 

atatgtttct 

ataaaaatgt 

cccagcactt 

ctgggcaaca 

catgtcccag 

gctgcagtga 

ttcaaaaata 

tctttacctg 

ttttgtctca 

gcttagaacc 

cccaacaatt 

aaaaggatat 

atgctagtgt 

tttttttttt 

cggctcactg 

aactgggact 

tttagtagag 

atccgccctc 

cggccaactc 

gaagaaaaaa 

aaacagtttt 

gtgaaaacca 

cagattgtga 

gtgtgctagt 

ttatctggtc 

gttctgttca 

tgtgctttgg 

acattagtac 

tgcttacaac 

catactttta 

gtttttgttg 

ggctggagtg 

attctcctgc 

taattttttt 

tctcccgacc 

gccaccgtgc 

ttaaattgac 

gcatataaac 

gtaggaacga 

tagtaggcaa 

caatggctca 

tctgccagtg 

tgagctgatt 

gatgaatttt 

ttaaatatat 

ctttgggagg 

catggggaag 



gtaatcccag 
atcctggcta 
ggtagtgggc 
ctgggaggca 
gagcaagact 
tatatagcaa 
aaaaacgtga 
tagcacttaa 
aactaagtat 
tggtactggt 
aaatatgaag 
tggcatggtt 
cgtaccatcc 
aagatttaag 
ttctattgtt 
tgggaggcca 
tagtaagacc 
ctacttggga 
gctatgatcg 
aaaaattggg 
cttgttaatt 
ataaatggga 
tggtagtgtc 
actcaccact 
ttaactggca 
gaagtgctag 
gagagggagt 
caagctccgc 
acaggcgccc 
acggggtttc 
ctcagcctcc 
gtattcctaa 
tgtaccgaac 
caggagcatg 
acactcctga 
gtggctgccc 
agggcctagg 
tctccttcct 
cagcctgctg 
gcacattaac 
cacctcataa 
aatgcttggc 
aaaaacctga 
ttgttgttgt 
cagtggcatg 
cttagcctcc 
gtatttttag 
tcatgatctg 
ccggccatga 
tgtttcctcc 
acttcacttg 
aatcttattc 
aatcaggtgt 
aaataatata 
aagtatacat 
gagcactgac 
ttttttggta 
aaaaagtcat 
ccgggcaggt 
ccccatctct 



cactttggga 

acacagtgaa 

gcctatagtc 

gatcttgcag 

ccatctcaaa 

taagttggaa 

agaggccaag 

caaagttgaa 

aaaattccac 

gttaaaaaaa 

cacattaaaa 

gggaatgttt 

taaagtttta 

taactttcca 

aagaatatta 

aagcaagtag 

ccatctctac 

gactaaagtg 

caccactgca 

gtttatctac 

tcaaaccctt 

agtatcagga 

cctctgagcc 

aagaaattat 

aacaaaagtc 

gcttattttt 

ctcacactgt 

ctgctgggtt 

accaccacgc 

accgtgttag 

caaactgctg 

acgaatcata 

tgtatgatat 

tttcctattt 

attccaccca 

cgaggtgtac 

aaaaacagaa 

ccctagcctg 

ctgcccccgt 

ctctcccagc 

agttgttatg 

acttggtaaa 

tgcttttaaa 

tgttgttgtt 

atctcggctc 

cgagtagctg 

taaagacggg 

cccacctcgg 

atggttcagt 

ttttttgtac 

cacttattta 

attgttgaaa 

tcaattctat 

ataagcctta 

agttttaaag 

cagatacagg 

cagattctga 

tttcggccag 

ggatcacttg 

actaaaaata 



ggccgaggca 

gccccatctc 

ccagctactc 

tgagtcgaga 

aaaataaaaa 

acttttactt 

ttccaagact 

aagttaacaa 

agagctggtt 

aaaaaaaaaa 

gaaaaaaata 

ggtctatagt 

ttctagattc 

tgtttatctc 

gagttggacg 

tttgtttgag 

aaaaaataaa 

ggaggatcac 

ttccagcctg 

ttagattttc 

ttctacattt 

agtcttttta 

ccagttcttt 

gcttgtgctg 

aggagaatgg 

caaatgccca 

cgcccaggct 

gacgccattc 

ccggctaact 

ccaggatgat 

ggattacagg 

attttaccat 

gatggtgtca 

tggtgtcagt 

gaactcacac 

taccaacctc 

cagatgagga 

actgctattg 

gggccatggt 

gtcactatct 

aggatcacca 

cactaaataa 

aattttttct 

gtttgagacg 

actgcaagct 

ggattacagg 

gtttcacctt 

cctcccaaag 

tttaacaggt 

ccatcatact 

tttaaatgtc 

cctctagcac 

taactactct 

actctggggt 

gttaaaaaaa 

atccttaaac 

cttcaaggga 

gcgctgtggc 

aggtcaggag 

caaaaattag 



ggcagatcac 

tactaaaaat 

gggaggctga 

tcgcgccact 

aaataaaaaa 

gaataatttt 

catgttatgt 

agccaagtac 

ttcttatctt 

aggaaatatg 

tatattcctg 

tttaacaatt 

tctggagttt 

cttttatgtt 

cagtggctca 

cccgggagtt 

aaattagccg 

tttgagccca 

ggcaacaaag 

aataaaaatt 

tgatttatct 

cttgctcaag 

ccaacctgcc 

tcatggcagt 

ggagattttg 

actcgtattc 

ggagtgcagt 

tcctgcctca 

tttttttttt 

cttgatctcc 

cgtgagccac 

aagaccatag 

aaaagaacta 

ggaccacttg 

tctgcacctt 

cagcttccgt 

cagtgatgca 

gagggcaccc 

tccaggacgg 

tcatcagcaa 

gtgagataat 

atgatagttg 

gctgactagt 

gagtctcgct 

ccaccccccg 

cgccggccac 

gttagccagg 

tgctggggtt 

tctgtgcctt 

ttgaatatat 

catctttcca 

atagcacagt 

aacactgaac 

gcttaaattt 

aatcaagtgt 

ttcataatat 

tttgcaggct 

tcatgcctgt 

ttcaagacca 

ctggactttg 
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tggtgcttgc 
gggaagcgga 
gacagactcc 
agaaaatgtc 
caaagtgaat 
aagcatgttg 
tccgtcctag 
tagcatcttt 
catttccatt 
aattggattt 
taaatataat 
attaagtctc 
tatgtgtgcc 
tgctgaagag 
ggctgaggcg 
agacactatc 
ctgcagtccc 
gctgcagtga 
aaaaaaaaaa 
atacatcccc 
gtaaatagct 
taaaaaggag 
ctaagtgaaa 
aatcccagca 
agcttgggta 
ggtgacacat 
tattgcgcct 
gtgacagagt 
tgaaatatcc 
agagaaaggg 
aaatattttg 
actgaattgt 
tctctgaatg 
tagtattaga 
tttggcacat 
attcctagtc 
ttcaaaaggt 
aatatagtat 
acagcagatt 
gttaacagag 
acacagcaac 
attacagcag 
tccaaagata 
tgtacacatt 
tgtcagatag 
agtataagtt 
tttgttagag 
atagaatgcc 
agtctatttc 
ttttcttaat 
acaccaaaga 
gggaggctga 
gacaaaaccc 
tagtcccagc 
ctgcatgagc 
tcaaaaaaaa 
cagaggactg 
aggtttatat 
gaatcagtcc 
cagctttttc 



ctgtagtccc 
ggttgtaatg 
agctcaaaaa 
taggggaact 
gaggcagtgg 
catggtaatt 
gtactctcct 
ggggccctca 
atggctccca 
ctgctcggat 
gattctccag 
atttatatgt 
ttactggtaa 
tccctcttat 
ggaagatcac 
tctacaaaaa 
agatgcacag 
gctgtgatca 
gttgaagatt 
ctgaggagta 
aactgggtaa 
taaggtacag 
gaagccaaac 
ctttgagagg 
acatagcaag 
actaatatta 
gggagatcaa 
gagaccctgt 
aaaataggta 
attgggaagt 
gaactaaata 
tcactttcaa 
atattataaa 
agattcagta 
gaggtgctca 
atgaatcaag 
catggtctcc 
tcactcttct 
ttatataagc 
caagaataag 
tggcagcaga 
aacataatca 
gaaaacttgt 
tttatgtgtt 
atatttagta 
ataagagttt 
aaattagaaa 
attaataatt 
caaaatgtat 
tttgaattat 
gtagtttgaa 
ggtgggagga 
catctctaca 
tactcaagag 
catgattatg 
aaaaacgtct 
tactttatga 
ttcagatctg 
tgtggctgat 
ccttctcagg 



agctactcag 
agctgagatc 
caatacgttt 
cgaaggagat 
caggggggaa 
gtgacatttg 
ttttcttcat 
ttgctgtgtg 
tagcctttga 
tcatgtctat 
tagttatatt 
aaaacatatt 
catttttaat 
ccttactggc 
ctgagctcag 
ataaaaaata 
gaggctgagg 
taccactgta 
ctgaaagatt 
gttgatgaaa 
gcaaaatgtg 
aactgatgct 
ataaaaggtg 
ccaaggcagg 
accttgtctc 
gtgtagtccc 
ggctgcagag 
ctcaaaataa 
aattcataca 
aactacttaa 
gaagcagtgg 
atggttaatt 
gatgtcctat 
taaatatgat 
gtgaaggttt 
caagcagagt 
ttataggaga 
ttctatagcc 
agcactctag 
ggaggtaaca 
aagctgctgg 
tgagtgttca 
gcatttacag 
ggtacttaat 
cagtatattc 
aaaaacacta 
tgttgagtac 
cttatatact 
ttaacaacct 
gatgttaaat 
agctgggcat 
ttgctttagc 
aaaaatataa 
gctgagctgg 
ctaccgcact 
acctttctat 
caacgttaaa 
tttcctaaaa 
ctagaccccc 
gttacttcgt 



gaggcttagg 

acaccactgc 

ttttaatctt 

gattatagga 

agggggaacc 

cttcctgcga 

tggtctgatg 

cctcatcatg 

gttagttggt 

acccatgact 

aattcatagt 

acaggttatt 

aagactagct 

agctaaaaca 

gage tt gaga 

aaaaaattag 

tggggggatt 

ccccagccta 

tctaaatatg 

aataacatga 

gaatatctac 

accaggtgaa 

acgtggctgg 

agaatcattt 

tactaaaaat 

acctactcca 

ccgtgatagc 

aaggtcatat 

gatagaaaac 

ttggaatgaa 

gtgeaaaaca 

ctatttcatg 

gttcagtatt 

aaaatttaaa 

agctcaattc 

tactactgag 

accctccatt 

tacaagctta 

tagtaaatag 

ggaaacttct 

ctcaccagga 

taaggaggaa 

atattgtcta 

tggacaaatg 

cctggacatt 

agataacttt 

cgtaaaagtt 

tacattttct 

aactttcatc 

agttttcatg 

ggtggctcat 

ctaggagctt 

aaattageca 

gaggatcaat 

ccagcctggg 

cttttcaagt 

gatatgagat 

aaaaaaaaaa 

tacagageca 

gacaaactgg 



cagaagaatt 
actccagcct 
gtcccttaat 
gttgattatg 
agtaatgact 
gcggagctga 
tccatgatga 
ggtctcttcg 
geccaggatg 
gttggcccac 
attttctact 
gattactggt 
attaaacagt 
ctatataaaa 
ccagcctgga 
ccaggcgtgg 
gcttgagccc 
ggcgaaaccc 
teggcttttg 
tagaaaaaac 
agtgcaatat 
tggacctcaa 
gcatggtggc 
gagctcagga 
aaaaaaaatt 
gaggeggagg 
accactgcac 
ategtatgat 
tggtgactgc 
gtgtcctttt 
ttgtgtatgt 
tgaattccac 
acttgggtgc 
tgtgtgtact 
attctcatac 
cagaccaaat 
ccttcttctc 
gttttatagt 
ttttgetace 
aaagcaggca 
ggctctgagg 
gagttcagga 
ttttatatat 
gtcagctatt 
ttagataact 
taaaaccatg 
ttcagaagca 
tccaggtact 
caaattatta 
ttatttcata 
gectgtaate 
gagaccagcc 
ggcatggtag 
tgagcttggg 
tgacacagtg 
aatgtcttgg 
ctttttcttc 
gtctgatgtc 
cacttgttct 
gctcctatga 



gcttgaacct 
gggcaacaga 
ggaaatattg 
tatgtaaaat 
tagaagtcct 
ccttgtggtg 
ttcctctgtg 
atggatgctt 
tctcccaagc 
ecattgeagg 
tcaggtctta 
ettttgettt 
atgttgaatt 
cactttggga 
caacatagtg 
tggcatgeat 
agaggtcaag 
tatctttaaa 
gtaagagatc 
ccaaatgcct 
tattcageca 
gaacattatg 
tcacacctgt 
gttccagact 
agectggcat 
cgagaggatc 
tccagcctgg 
ttatttcatg 
caggggctgg 
ggcatgatga 
getaaatgee 
ctctagaaat 
ctgtgtcaat 
gtcatctgtg 
cctgaactgt 
ccagtttaag 
tcccgcaggt 
ttctaagtcc 
agaagtgaca 
gaacaccacc 
gcatcatagt 
atggtctgat 
gcagaatgga 
tatttaacaa 
taggttttat 
agtcctggaa 
gaacccgaat 
tataaacctt 
aatttaaaat 
aaaaacaaat 
tcagcatttt 

tgggtaatgt 

catgcacctg 
aagtcaaggc 
agaccctgtc 
tatgaaaatc 
ctatcaaaaa 
tgagatgtct 
cccttggtga 
tgtggcattc 
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tacctcgctg 
catagtaaga 
ttggaaaacc 
tctattattt 
aagtttcctt 
atagcacaat 
atatctctga 
aaagtcacat 
aatgatcttt 
gtttggttag 
ttttactttc 
taccagatct 
tatgatatta 
aacatagtgt 
ctgttcagaa 
tcaagtagcc 
atagaatgca 
tgttggcact 
tacaggcaca 
tgaagaaaat 
gtctgaaaag 
ggcaaaaatt 
tttatataca 
catgccatga 
tgctttagta 
ttactttgtc 
aggggactta 
gaccatacta 
tgtccagtta 
aagaccaatc 
gcctataatc 
gagaccagcc 
ggccaagcat 
tcacttgagc 
aaaaatttaa 
ctgaggtagt 
tgtactctag 
aataaatgtt 
caattttaca 
ttttcttttc 
ttgcttggtt 
catatgtgtt 
actggatcta 
tataataata 
catatttagt 
atgaaaaata 
ttctgaatga 
caatccagac 
tcctcatcct 
gaggccaaaa 
actattgtaa 
ggatattgct 
atgcaagatg 
aagacttaaa 



gagtccctcc 
agcaaagaga 
agaactctct 
aatatcttac 
tccttttata 
aattgggaaa 
tgtttccatg 
attgtgaaaa 
agcttgtaca 
gagtggaaat 
ttatgctctt 
acctaaacct 
agatattctg 
tcagaatcaa 
atcatttaag 
agaaacaccc 
tgaaagatat 
agaacctgat 
aattggtgga 
atctgtagct 
gttaaaagca 
tgcactcata 
gattatcata 
cccatactca 
cttgaggcct 
atacatgcag 
gataatttgc 
ccctttcatg 
tataaaacgt 
atcaaaccca 
ccagcacttt 
tgggcaacat 
agtggctcat 
ccagaagttc 
aaattagcca 
ggaaggattg 
cctgttcaac 
aaattttgtt 
gaaagtaatt 
aaaatttggc 
agtttccata 
tgtatattaa 
taacactgaa 
taaaattaag 
actatgtttt 
ctaatataaa 
agtgttactg 
acttttctga 
gtttttgata 
aaagcatgaa 
cagatggcca 
gggatccatc 
tgttcatctc 
att 



ccttattgga 
gatcagtaaa 
gctgtcaagt 
atacctccac 
caaattgcaa 
tagaaccctt 
agtctgaggg 
tttgaagcta 
aatgtttaaa 
attttgtttg 
tggaaacttt 
caagtctatg 
acttgctcca 
tatgtgagat 
tttacagcgt 
cacgtttctg 
tctggcgatt 
atttaaacaa 
ggctggagga 
gatagaaata 
cttataaaaa 
ttatttatcc 
agaaagctct 
tttacaaaaa 
ttaaaagtta 
ttgcacctta 
ctgtggatgg 
ttcttattat 
tactttctca 
ctattataaa 
gggaggctga 
agcaagatcc 
gcctgtaatc 
aagaccagct 
ggcatgatgg 
cttgagccta 
tgagcagaac 
ttaagtttta 
caaaaatgaa 
tgattttagg 
ttaaaacagc 
acttcacata 
aaagttctat 
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